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FOREWORD 


This RECORD will be of interest to those using conventional transportation forecasting 
techniques. The papers cover a wide variety of topics but emphasize improvements 
to procedures in the areas of trip-generation, modal-choice, and traffic-assignment 
techniques. Forecasting for recreational travel and pedestrian flow and for new urban 
transportation systems is also covered. 

It is important to note that attention is again being paid to an important and some- 
what neglected area: assigning travel to a network of facilities. Regardless of the 
methods used to obtain travel demand forecasts, the ultimate use of such forecasts is 
to assist responsible agencies in facility planning, evaluation, and design. The assign- 
ment of demand to a facility is the last and perhaps the most important step in this 
process. Multipath, multivariable traces and incremental loading techniques are dis- 
cussed. In addition, direct assignment methods are evaluated. 

The importance of including the effects of the transportation system in estimates of 
trip-making potential is covered in several papers. One researcher concludes that 
provision of a high level of highway service does not increase the use of automobiles, 
but he points out that there is a lower use of automobiles in areas of high density and 
good transit service. Others emphasize the importance of including accessibility vari- 
ables in estimating person-trip generation. Both findings, then, indicate the importance 
of including transit system variables in trip-generation analysis. These findings point 
the way to significant improvements that can be made in present travel-forecasting 
models and procedures. 


—George V. Wickstrom 


MODAL-CHOICE MODEL FOR RELATING DEMAND 
TO INVESTMENT 


Salvatore J. Bellomo, Christopher G. Turner, and Denis K. Johnston, 
Alan M. Voorhees and Associates, Inc. 


The development of a macromodel for modal choice is presented. The 
model relates investment to transit supply, supply to level of service, and 
level of service to demand. The model is developed at a city-wide level by 
using a simulation technique involving the analysis of 13 individual highway- 
transit system tests for a city of 3.5 million people. Land use activities 
are allocated on the basis of accessibility provided by both highway and 
transit systems. The generation of travel is sensitive to the level of ser- 
vice provided, and the distribution of trips is achieved by using weighted 
highway-transit skim trees and a standard gravity model. The modal- 
choice analysis employs a utilitarian model develpedas part of a study of 
models calibrated for existing cities. The macromodel relates demand to 
mean travel-time difference and mean travel-cost difference between tran- 
sit and highway for work and nonwork purposes for both peak and off-peak 
periods. The mean city-wide transit travel time is related to the supply 
of seat-miles of service per capita, and the highway travel time is related 
to capacity-miles of highway per capita. Transit supply is in turn related 
to the capital and operating costs of providing that service. In application, 
the model assumes a fixed level of highway supply and has as policy vari- 
ables the absolute investment level in transit, the split of investment be- 
tween bus or rail rapid transit and conventional bus transit, the transit 
fare, the split of service between peak and off-peak periods, and the park- 
ing cost, 


eTHERE IS a critical need in the urban transportation planning process for an analyt- 
ical capability to permit the testing of multimodal and multiregional transportation- 
investment policies. In recognition of this need, the U.S Department of Transportation 
sponsored a series of studies aimed at improving that capability. One of these studies 
was the Transportation Resource Allocation Study that considered the quantity and the 
quality of the transportation system as well as the basic indicators of socioeconomic 
characteristics such as population and automobile ownership. In an earlier report, 
Kassoff and Gendell (1) described a process and travel-demand forecasting procedure 
to project future urbanized-area travel. This work has been further expanded in En- 
gland by Lesley (2), who has attempted to establish a relation between the structure 
and operating performance of urban public transportation systems and the macroparam- 
eters of European cities. 

This paper is concerned with the development of a modal-choice model that can be 
used to relate demand to investment on an urbanized-area basis. The model provides 
the planner with a simple analytical tool to estimate transit investment for a given 
demand, estimate transit demand for a given investment, or estimate the level of tran- 
sit service for a given demand or investment. 

In the development of this macromodel, a simulation approach was used. That is, 
the basic relations have been established by simulating the urban-activity characteris- 
tics for 2 large metropolitan areas, and an experimental design was developed to sketch 
feasible combinations of highway and transit levels of service for both areas based on 
the known theory of travel demand and urban activity. The reasons for adopting a sim- 
ulation approach are as follows: 


1. Supplying an adequate and consistent empirical data base across several urban- 
ized areas is difficult; 

2. Certain variables of concern (e.g., high levels of transit service) do not cur- 
rently exist on a city-wide basis; 

3. Sensitivities that emerge on a macrobasis can only be established through a 
microsimulation and then a city-wide summary; and 

4. Interrelations among land use, transit, highway, and other policies for a metro- 
politan area make sensitivity testing very difficult. 


EXPERIMENTAL DESIGN 


The experimental design developed for the study is shown in Figure 1. The number 
of transportation alternatives (simulation tests) was set at 13, and the figure shows how 
these alternatives were developed to cover a wide range of highway and transit system 
service levels. 

In each case, the design of the system was based on the characteristics of systems 
operating in, or proposed for, existing cities. A mix of transit systems is implied by 
the categories of conventional bus transit, bus rapid transit, and rail rapid transit 
because the operation of each system is dependent on supporting feeder and distribution 
systems, 

Two base-year urban-activity patterns were developed in the study to simulate a 
range of existing cities. Specifically, patterns of central-city activity concentration 
and suburban-activity dispersion were tested. From an analysis of existing cities, each 
transportation system was matched with the urban-activity pattern to which it was most 
closely related, and as a result the 13 tests shown in Figure 1 were selected. 


OPERATIONAL PROCEDURE 


To provide inputs to the macromodel, an operational procedure was developed and 
applied to each of the 13 simulation tests shown in Figure 1. This procedure is shown 
in Figure 2, and basically it has involved the following major components, 


1. The development of a procedure enabling the prediction of microutility modal- 
choice curve characteristics (3, 4) for a city, given a set of macroparameters describ- 
that city, involved a research study of the characteristics of existing microutility 
curves in order to relate those characteristics to the macroparameters of the city for 
which each model was developed. (For the aggregate transit percentages output from 
the study to be realistic, a modal-choice model that accurately reproduced the trip- 
maker's modal-choice decisions had to be utilized for each simulation test. Such mod- 
els are usually developed and calibrated for individual cities on the basis of trip- 
interchange data. Because such data are not available for the simulated cities, it 
was necessary to develop a relation between the characteristics of microutility modal- 
choice models and aggregate city parameters.) A relation was developed between the 
microutility modal-choice curves calibrated for Los Angeles, Twin Cities, and New- 
castle (England) and the mean city-wide automobile ownership stratified by the level of 
automobile ownership (5). The resulting family of curves provides the basis on which 
the microutility curve for different cities can be estimated directly from a knowledge 
of the mean automobile-ownership rate in the city. 

The model is essentially a 3-dimensional surface, as shown in Figure 3, for each 
level of automobile ownership. However, because only mean automobile-ownership 
rates were predicted for the simulated cities, the model was aggregated over levels of 
automobile ownership for use in the study by using a unique relation between mean auto- 
mobile ownership and the percentage of households in each ownership group. 

2. The development of realistic urban-activity patterns and transportation system 
alternatives to use as input to the simulation study was achieved from an extensive 
analysis of existing cities and resulted in the development of 2 base-year distributions 
of population and employment representing a highly concentrated city typical of older 
cities and a more dispersed automobile-oriented city. Those cities were matched 
against the transportation systems as shown in Figure 1. Base-year distributions of 


Figure 1. Experimental design. 
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Figure 3. Three-dimensional surface of 
micromodel for modal choice. 


Figure 2. Operational procedure for development of 
variables for macromodel for modal choice. 
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Table 1. City-wide variables generated by each simulation test for input to macromodel. 


System Characteristics 


Component Highway Transit Urban Activity Characteristics 
Demand* Number of trips Percentage of transit work trips Population and population den- 
Vehicle-miles Percentage of transit nonwork trips sity (CBD, central city, and 
Percentage of transit total trips suburbs) 
Passenger-miles Manufacturing employment and 
. manufacturing employment 
Devel at x Mean gus, Pate Mears walk fg density (CBD, central city, and 
service Mean travel cost Mean wait time suburbs) 
reas arta anu ee he eis Office employment and office 
Ee Ses tS cere employment density (CBD, 
Mean speed Mean speed tral cit A antasctual 
Mean trip length Mean trip length Bete CHU pane. Bure 
Population following employment 
Supply Arterial miles Fleet size” (number of buses and number of and population following em~ 


Freeway miles 
Highway miles 
Arterial lane-miles 
Freeway lane-miles 
Highway lane-miles 
Canacity-miles 


Investment 


rail cars) 
Miles of private right-of-way 
Route-miles 
Number of rapid transit stations 
Seat-miles* 


Capital costs (guideway and rolling stock) 
Daily operating costs (labor and other) 


ployment density (CBD, central 
city, suburbs) 

Mean income 

Mean automobile ownership 


“Stratified by peak and off-peak period 


4 


mean zonal income, automobile ownership, parking costs, and transit fares were es- 
tablished for both cities. [The distibution of mean zonal income was calculated from 
1968 in-house data for Cleveland (6). Automobile-ownership rates were calculated 
from a relation among income, transit level of service, and automobile ownership 
developed for Memphis, Tennessee. Base-year zonal parking costs were developed 
from a procedure used in the Washington Metropolitan Area Transit Authority study 
(7), and zonal transit fares were developed from in-house data and library research 
material. | 

3. The application of urban-activity, trip-generation, trip-distribution, modal- 
choice, and trip-assignment models to each simulation test developed inputs to the 
macromodel. Initially, distributions of population, employment by type, and income 
were allocated to the simulated city zones to produce the base-year patterns of con- 
centrated and dispersed urban activity (which in each case had a base-year population 
of 2.5 million and a labor force of 900,000). At the same time, the 13 simulation tests 
shown in Figure 1 were devised, and the respective highway and transit systems were 
developed. Peak and off-peak highway and transit skim trees were then built for each 
test, and a composite skim tree was developed and weighted in terms of work and non- 
work trips, peak and off-peak trips, and highway and transit trips. The projected city- 
wide increases in population and employment by type were then distributed to zones on 
this composite skim tree by using a simple activity-allocation model (8). [A projec- 
tion period of 20 years from a 1970 base year was assumed in the study. Both simu- 
lated cities were assumed to have the same growth in population and employment, and 
growth was based on the mean projections for the 40 largest urbanized areas in the 
United States during the period 1970-1990 (8).] 

It was also necessary to project the zonal increase in income and automobile owner- 
ship. Inthe case of income, however, a literature survey indicated that little infor- 
mation was available on changes in the spatial distribution of income over time. A 
simple model was, therefore, developed to allocate the total city-wide increase in 
income to zones. The mean city-wide increase in income was based on the Cleveland 
study projections (8), and this total was allocated to zones in proportion to each zone's 
population growth during the projection period relative to the city-wide population 
growth and to each zone's base-year mean income relative to the city-wide base-year 
mean income. 

Automobile-ownership rates for the projection were developed by applying the 1990 
zonal income projections to the base-year relation established among income, transit 
level of service, and automobile ownership. 

The 1990 distributions of activity and socioeconomic characteristics were then used 
to forecast trip-generation rates [which were sensitive to the level of service provided 
(9)] for home-based work, home-based nonwork, and non-home-based trips. So that 
they would be consistent with the 1990 projected distributions of activity, socioeconomic 
characteristics, parking costs, and transit fares, the composite skim trees were up- 
dated. The 1990 trip productions and attractions, composite skim trees, and F-factor 
curves (10) were used, and the gravity model was run for each of the following trip 
purposes associated with each test: home-based work, home-based nonwork, and non- 
home-based. 

Trip tables were then combined into the categories of work and nonwork trips and 
factored to provide peak and off-peak trip tables. The resulting 4 tables were split 
into automobile and transit trip tables by using the micromodel described previously 
and shown in Figure 3, and the trips were loaded onto the highway and transit networks. 

The reasonableness of the assumptions made about the volume to capacity relations 
was tested by using a standard equilibrium approach to check the output highway speeds 
against the initially assumed highway speeds using capacity restraint. If the output 
speeds were significantly at variance with the initially assumed speeds, the output 
speeds were adopted, and the entire procedure was reiterated for that particular simu- 
lation test. If the initially assumed speeds were within an acceptable range of the out- 
put speeds, the highway assignments were adopted. 

Transit trips were then assigned to the transit network; for those tests having more 
than 1 transit mode, the assignment to a specific mode was made on the basis of rela- 
tive modal capacity. 


4. The parameters describing each simulated city test were spatially aggregated, 
and the relations between those parameters and the city-wide percentage of transit usage 
were developed after the simulation tests were complete. Table 1 gives the parameters 
that were generated and considered as possible variables for the macromodel. 


THE MACROMODEL FOR MODAL CHOICE 


The variables from the simulation tests given in Table 1 were used to develop city- 
wide relations of transit supply and investment, level of service and supply, and demand 
and level of service. Alternative combinations of variables were tried for each of the 
3 components of the macromodel in an attempt to develop a model that was both concep- 
tually sound and statistically accurate. The final relations established for each of the 
components of the model are now presented. 


Supply and Level of Investment 


Figures 4 and 5 show the relations that were developed between transit supply and 
level of investment. Transit supply (measured in terms of rapid transit seat-miles) 
is highly correlated with both capital costs and operating costs on a per capita basis. 
The number of seats per vehicle was based on the characteristics of existing transit 
systems (11), and it was estimated that each bus had 50 seats, each rail car had 80seats, 
and each train had 6 rail cars in the peak period and 3 in the off-peak period. 

For each system test, capital costs were developed for both rolling stock and guide- 
ways by using 1968 Washington Metropolitan Area Transit Authority unit cost figures 
(12). Guideway costs were made up of the type of line construction (e.g., subway, ar- 
terial, or grade construction for rail systems) and of station costs, which included the 
cost of providing parking facilities. Rolling-stock costs were calculated per vehicle 
for buses and rail cars, and operating costs were calculated (from the same source as 
capital costs) for conventional buses, rapid transit buses, feeder buses, and rapid tran- 
sit rail cars on a daily basis. 


Level of Service and Supply 


Because the demand component of the macromodel was developed to deal with both 
peak- and off-peak-period travel, it was necessary to develop peak and off-peak rela- 
tions between the level of service and supply variables. (Peak and off-peak level-of- 
service measures were then input into the demand component of the model.) 

Figure 6 shows the peak-period model developed to relate transit-travel time to rapid 
transit seat-miles per capita. As shown in the figure, there are distinct relations for 
bus-oriented systems and rail-oriented systems. The model indicates that the level of 
service provided by rapid transit bus-oriented systems is very sensitive to changes in 
the level of supply over a comparatively narrow range of supply. In contrast, the model 
shows that the changes that canbe brought about in the level of service for rail-oriented 
systems by varying the level of supply are of a small magnitude, although the supply of 
rapid transit seat-miles is greater and covers a wider range. Figures 4 and 5 show that 
both capital and operating costs of rail-oriented systems tested are higher than those of 
bus-oriented systems. The implication is, therefore, that, for a given increase in in- 
vestment, the bus-oriented systems are more cost effective in terms of providing an 
improved peak-period level of service than are rail-oriented systems. (This conclusion 
relates to the experiment described in this paper. It is not meant as a general conclu- 
sion for all metropolitan areas.) 

For the off-peak period, little correlation was found to exist between off-peak level 
of service and supply variables in absolute terms. However, when a ratio relation was 
developed, a strong correlation was found to exist between the off-peak to peak ratio of 
transit-travel time and the off-peak to peak ratio of total system seat-miles per hour. 
This model, stratified by highway level of service (expressed in terms of capacity-miles 
of highway per capita), is shown in Figure 7. The stratification by highway level of ser- 
vice is significant. It implies that different levels of off-peak service can be provided 
for a given level of off-peak supply, depending on the quality of the highway system. 
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Figure 4. Rapid transit seat-miles per capita versus capital costs per 
capita. 
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Figure 5. Rapid transit seat-miles per capita versus daily operating costs 
per capita. 
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Figure 6. Peak-period transit travel time versus peak-period rapid transit seat-miles per capita. 
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Figure 7. Ratio of off-peak to peak-period travel time versus ratio of off-peak to peak-period seat-miles per 
hour. 
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Perhaps more important, it indicates that, to provide the same level of off-peak service, 
cities with poorer highway systems require a higher level of supply than cities with good 
highway systems. Third, the relation also makes explicit the effect of different levels 
of highway investment (and hence supply) on the existing level of off-peak transit service. 

Inasmuch as the demand component of the macromodel was based ona level-of-service 
difference variable (transit minus highway) for both peak and off-peak periods, relations 
were developed to provide the highway level of service inputs to the demand model. 
These relations are shown in Figures 8 and 9 for the peak and off-peak periods respec - 
tively. In each case, highway-travel time was found to be highly correlated with the 
supply of capacity-miles of highway per capita. 


Demand and Level of Service 


Of the alternative combinations of macroparameters that were used in the develop- 
ment of a relation between transit demand and level of service, it was found that travel- 
time and travel-cost differences (transit minus highway) best accounted for the varia- 
tions in transit demand. Figures 10, 11, 12, and 13 show the 4 demand models developed 
to estimate transit usage for work and nonwork purposes in the peak and off-peak periods. 
The initial relations were developed at a 10-cent cost difference, and the sensitivity of 
demand was later tested against the range of cost differences shown in these 4 figures. 

The relations developed here between demand and level of service are significant for 
several reasons. For a given travel-cost difference, the demand models make it pos- 
sible to identify the effects that changes in transit-travel time (brought about by specify- 
ing different levels of investment and mixes of transit supply) have on demand. These 
effects can be stratified by time of day (peak and off-peak) and by trip purpose (work 
and nonwork). Similarly, the effects that changes in transit-travel cost have on demand 
can also be identified for a given travel-time difference. This is achieved by varying 
the transit-fare policy, and again the effects would be stratified by time of day and type 
of trip. 

Alternatively, the demand models could indicate the effects that different levels of 
highway supply (measured in terms of changes in highway travel time) and highway cost 
(measured in terms of changes in parking cost policies) have on demand. 


IMPLICATIONS 


The macromodel for modal choice presented in this paper would have considerable 
application in multimodal and multiregional transportation-investment planning. It can 
be utilized to test alternative transit-investment policies across cities. Fora given 
level of investment, a measure of supply in terms of rapid transit seat-miles per capita 
can be used to determine the level of transit service for that level of investment. From 
a prespecified highway supply, the highway level of service can be determined. Knowing 
both the highway and transit levels of service, one can estimate the percentage of tran- 
sit trips. 

The model can also be applied incrementally to predict the likely increase in transit 
patronage that would result from alternative investment policies for existing systems 
or to estimate the usage that would result in investing in new systems. It, therefore, 
provides a capability and analytical link that previously required a considerable elapsed 
time and computer cost to obtain. It is not a panacea, however, and should be consid- 
ered as an initial rather than a final screening process when a wide range of transit- 
investment policies is examined. 

The components of the model offer flexibility to the analyst. For a constant invest- 
ment, for example, differing systems mixes of total, peak, and off-peak rail rapid and 
bus rapid transit supply can be specified, and the sensitivity of total transit demand can 
be ascertained. In addition, the model allows alternative fare and parking policies to 
be examined. 

The user, however, should be careful that results are properly interpreted in anal- 
ysis of transit-investment policies. The model is a guide for checking aggregate de- 
mand, supply, and level of service characteristics. It is not a substitute for micro- 
analyses that are needed to determine the economic, social, and environmental effects 
of transit-investment policies for different groups of the regional community. 


Figure 8. Peak-period highway travel time versus highway capacity-miles per 
capita. 
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Figure 9. Off-peak-period highway travel time versus highway capacity-miles 
per capita. 
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CONSISTENCY IN TRANSPORTATION DEMAND 
AND EVALUATION MODELS 
Dan G. Haney*, Peat, Marwick, Mitchell and Company 


This paper is addressed to the evaluation of traveler benefits associated 
with transportation system alternatives. Itis asserted that trip-generation, 
trip-distribution, modal-split, and traffic-assignment steps, which are 
carried out by individual mathematical models, in the transportation plan- 
ning process are frequently not consistent with one another, The incon- 
sistencies arise because, while each is intended to represent travelers’ 
behavior, its mathematical form can imply inconsistent behavior. It is 
further asserted that a formal evaluation of traveler benefits, which is 
normally the last step in the modeling process, may provide further incon- 
sistency. These inconsistencies may lead to erroneous conclusions re- 
garding the relative desirability of one system over another. An earlier 
paper on this subject dealt with achieving modal-split model and evaluation- 
model consistency. The current paper covers 2 additional aspects of de- 
mand modeling: the use of a trip-distribution model and the use of a 
total-demand model. Procedures for making consistent traveler-benefit 
calculations with each model are suggested. 


©THE PURPOSE of this paper is to attempt to provide a consistent and logical bridge 
between the methods used to conduct evaluations of alternative transportation systems 
and the methods used in other phases of the transportation planning process. 

An earlier paper (1) dealt with specific problems and inconsistencies that result in 
the evaluation of multimodal alternatives when the modal-split procedure is not closely 
coordinated with the evaluation procedure. That paper compared conventional methods 
of calculation of traveler benefits with recommended methods designed from different 
modal-split models. It was found that a suitably adapted consumers' surplus approach 
to benefit calculations provided correct results, whereas the more conventional traveler- 
expenditure methods were erroneous. A procedure for structuring the analysis method 
and for making the calculations was presented. 

The reader is referred to this paper for background and discussion of the theory be- 
hind the results derived here. The presentation in this paper assumes that the reader 
is familiar with the earlier paper. 

The present paper offers suggestions for tying evaluation methods more consistently 
to the earlier demand-modeling phases in the planning process. Separate sections of 
the paper deal with recommended procedures applicable to the following 2 approaches 
to demand estimation: 


1. The use of a trip-distribution model, in which different distributions of trips are 
estimated for different transportation system alternatives; and 

2. The use of a total-demand model, in which trip generation, trip distribution, and 
modal split are accomplished in a single model. 


Specific examples of each approach are described; however, the results are generally 
applicable to other formulations of demand, 


*Mr. Haney was with the Stanford Research Institute when this paper was developed. 
13 


14 


TRIP-DISTRIBUTION MODELS Figure 1. A simple network. 


As different transportation system alterna- 1 
tives are postulated and analyzed in a planning 
process, the most common method of dealing 
with demand is to make a single estimate of 
total trip-interchange volumes (a trip table) and 
to consider that demand as fixed over all alter- 
natives. This procedure has appeal; with de- 
mand fixed, a search can be undertaken to find 
the alternative that produces the given trans- 
portation service at minimum direct cost, and 
no thought need be given to whether different 
amounts of service are being provided. 

However, such a procedure would seem to 2 
violate logical inferences as to travel behavior. 

A reasonable hypothesis of travel behavior is 

that, as transportation impedances are selec- 3 
tively altered in a study area, persons will 

change their travel patterns. As particular 

areas (zones) become more accessible, there 

will be greater demand for travel to those areas. The analyst may wish to attempt to 
incorporate such change in travel behavior in the planning process. 

If the planning process has been designed along conventional lines, with trip gener- 
ation, trip distribution, and modal split being accomplished in 3 separate steps, the 
analyst may adopt a procedure of running the trip-distribution model separately for 
each transportation system alternative. If this is done under a fixed set of trip ends, 
redistribution will simply rearrange the trip patterns. The estimate will show some 
person trips lengthened along paths whose travel times or costs are reduced by a new 
alternative. Others will be shortened because the model compensates for the farther 
trips. 

As an example, consider a very simple network and zonal system consisting of 3 
links and 3 centroids as shown in Figure 1. Assume that the trip ends are to be held 
constant, as follows: O,; = D; = 570, Os = D2 = 540, and O; = Ds; = 420. Assume that the 
interzonal travel costs of alternative 0 are as follows: 


Destinations 


Origins Zonel Zone2 Zone 3 


Zone 1 6 12 15 
Zone 2 12 5 Li 
Zone 3 15 11 8 


If a simple gravity model with a cost exponent (cost being used in a general sense) of 
2.0 is used, the resulting travel volumes are as follows: 
Destinations 


Origins Zonel Zone2 Zone3_ Total 


Zone 1 420 82 68 570 
Zone 2 82 357 101 540 
Zone 3 _68 101 251 420 
Total 570 540 420 


Now, assume that the travel cost under alternative 1 between zone 1 and zone 3 is 
reduced by 2 units. The new cost matrix is as follows: 
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Destinations 


Origins Zonel Zone2 Zone 3 


Zone 1 6 12 13 
Zone 2 12 5 11 
Zone 3 13 11 8 


The resulting trip distribution, using the same gravity model, is as follows: 


Destinations 
Origins Zonei Zone2 Zone3 _ Total 
Zone 1 402 81 87 570 
Zone 2 81 365 94 540 
Zone 3 _87 _94 239 420 
Total 570 540 420 


Looking at the results in detail, we infer that the changes in behavior can be described 
in terms of 6 groups: 


1. Eighteen persons previously traveling internal to zone 1 now choose to travel to 
zone 3, 


2. One person previously traveling from zone 1 to zone 2 now chooses to travel to 
zone 3, 

3. One person previously traveling from zone 2 to zone 1 now chooses to travel 
within zone 2, 

4, Seven persons previously traveling from zone 2 to zone 3 now choose to travel 
within zone 2, 

5. Seven persons previously traveling from zone 3 to zone 2 now choose to travel 
from zone 3 to zone 1, and 

6. Twelve persons previously traveling within zone 3 now choose to travel to zone 1. 


The changes in behavior must be carefully understood. While 2 groups (groups 3 and 


4) are saving travel costs by their changes in behavior, the other 4 are incurring higher 
costs of travel. 


Conventional Benefit Calculation 


Unless that fact is recognized, a conventional approach of comparing travel costs 
for the 2 alternatives would simply multiply the travel cost times the travel volume for 
each zone pair and sum these products over all zone pairs. The most attractive alter- 
native, in terms of travel cost, would be the one with the lower cost. Alternative 1 is 
preferred if 


Dx Cty Dy > > Cty Diy 
ij ij 


Alternative 0 is preferred otherwise. The variables given above signify that Cf = travel 
cost between zones i and j for alternative k, and Ds; = travel volume between zones i 
and j for alternative k. In the example the total unit costs are 12,543 for alternative 0 
and 12,423 for alternative 1. Thus, alternative 1 would appear to be favored by 120 cost 


units. This net difference is the result of both some cost decreases and some cost 
increases. 


Improved Benefit Calculation 


Another way of viewing the changed behavior is to hypothesize that persons choose 
their travel destination in relation to 2 factors: the value of being at a destination and 
the cost of getting there. Looking at all possible destinations that are available, each 
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personassigns his particular value to each and assesses the travel cost. The destination 
that has the highest excess of value less cost will be chosen. 

For persons in 4 of the groups given above, the following perceptions should explain 
their changes in behavior: 


Group Alternative 0 Alternative 1 
1 value, - cost, > value, - cost, value, - cost, < value; - cost, 
2 value, - cost, > value; - cost, value, - cost, < value; - cost, 
5 value, - cost, > value, - cost, value, - cost, < value, - cost, 
6 value, - cost, > value, - cost, value, - cost, < value, - cost, 


where value, is the value of being at zone x, and cost, is the cost of getting to zone x. 
Because it can be argued that the value of an individual's being at a destination does not 
change, the benefits for persons in each of the 4 groups must be directly related to the 
change in travel cost to the new destinations. Some people in the group will have rather 
high preferences for their selected destinations, while others will feel less strongly. 
Therefore, depending on the improvement in travel, different amounts of travel will 
shift. The maximum individual traveler benefit will accrue to the traveler who was 
previously on the margin between the 2 destinations. As an example, the group 1 mar- 
ginal traveler will perceive benefits as follows: 


Benefit = (value, - 6) - (value, - 15) - (value, - 6) - (value, - 13) 


= 2 cost units 


At the other extreme, the minimum benefit will accrue to the traveler who is just 
barely induced to change destinations by the change in travel cost. His benefit is 
slightly greater than zero. 

The consumers' surplus concept can again be used to assess benefits. In this case 
we consider the zone pairs that have experienced a reduction in travel cost. Using the 
conventional demand curve diagram as shown in Figure 2, we have the following situa- 
tion for zone 1 to zone 3 travel: The benefits accruing to the travelers in groups 1 and 
2 are shown in the triangle labeled B. In addition, the travelers who traveled between 
zones 1 and 3 under both alternatives will perceive a benefit, represented by the rec- 
tangle A. Using the consumers' surplus formula, we find that the benefits for the per- 

sons traveling from zone 1 are 


Consumers' surplus = */2(Co - C,)(Vo + Vi) 

= ¥/,(2)(68 + 87) 
Figure 2. Travel demand between zone 1 and . 
= 155 cost units 


zone 3. 


A similar argument and calculation can be 
made for persons traveling from zone 3. 
The 2 calculations will account for groups 
1, 2, 5, and 6, plus the benefits to travelers 
who continue to travel between the same 
' zones as before. 
‘ss | kl But what about groups 3 and 4? They 
have changed their travel behavior, but 
neither the values of being at the before- 
and-~after locations nor the costs of getting 
to those locations have changed. In the nor- 
malizing process, the gravity model has 
forced them out of their preferred destina- 
tions to less preferred locations. We can, 
TRAVEL VOLUME however, observe that, for group 3, 


COST 


68 87 
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Value, - cost, > value, - cost, 
Value, - value, > cost, - cost, 
>12-5 
a | 
This group would be prepared to spend 7 additional cost units in order to travel to zone 
1 rather than zone 2, but, by the formulation and operation of the gravity model, they 


are forced to travel to destinations in zone 2 and thus must be incurring a disbenefit of 
at least 7 cost units. 


Disbenefit > cost, - cost, 


Not having information as to the actual values of being at the 2 destinations, we can 
understate the disbenefit by the calculation 

Disbenefit = cost, - cost, 

Disbenefit = 5- 12 =-7 


and, for group 4, 
Disbenefit = 5 - 11 = -6 


Suggested Procedure 


In summary, the total procedure for analyses of benefits and disbenefits is as 
follows: 


1. Determine which zone pairs would have reduced travel cost when compared with 
the base case. In the example, zone 1 to 3 travel and zone 3 to 1 travel would be 
identified. 

2. Identify the costs of travel and the travel volumes for those zone pairs. In the 
example, the following data would apply: 


Item Alternative 0 Alternative 1 
Zone 1 to 3 
Cost 15 13 
Volume 68 87 
Zone 3 to 1 
Cost 15 13 
Volume 68 87 


3. Make the consumers’ surplus calculation for each of the zone pairs given above. 
In the example, the following calculations would be made: 


Consumers' surplus for zone 1 to 3 = ‘A(15 - 13)(68 + 87) 
= 155 
Consumers' surplus for zone 3 to 1= 155 


4, Identify all other zone pairs that would have increased travel. In the example, 
zone 2 to zone 2 would have increased travel. 

5. Determine the travel cost and volumes for those zone pairs for the alternative 
being studied and for the base case. In the example, for zone 2 to 2 cost = 5, and vol- 
ume for alternative 0 = 357 and for alternative 1 = 365. 

6. Determine the number of trips that are reduced from the origins of the zone 
pairs given above to each destination and the original costs of the trips. In the example, 
the cost for zone 2 to 1 is 12 and for zone 2 to 3 is 11. The volumes are as follows: 
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Zone Alternative 0 Alternative 1 Difference 


2tol 82 81 1 
2 to 3 101 94 ar 


7. Calculate the estimated disbenefit for each change in volume. 


Disbenefit = (cost, - cost,)(volume change) 
or 
Benefit = (cost, - cost,)(volume change) 
In the example, the following calculation would be made: 
Benefit for zone 2 to 1 = (1 - 12)(1) =-7 
Benefit for zone 2 to 3 = (5 - 11)(7) = -42 


8. Add the results of steps 3 and 8 to obtain the total benefit. In the example, the 
total benefit is 


Consumers' surplus for zone 1to3 155 
Consumers' surplus for zone 3to1 155 


Benefit for zone 2 to 1 -7 
Benefit for zone 2 to 3 42 
Net benefit 261 


Such a benefit calculation produces an overestimate of benefits because the place- 
value losses of some travelers are not known. One can suppose that the overestimate 
is not large because the travelers who compete and win for the smaller number of trip 
destinations—in the example at zone destinations 3 and 1—probably have higher excess 
of values than those who compete and lose. It is the losers who are included in the 
disbenefit calculation. 

It is significant to compare the benefits by the 2 methods of calculation: 


Conventional method = 120 cost units 
Suggested method = 261 cost units 


If the theoretical approach of the suggested method is accepted, it appears to pre- 
sent significantly greater benefits than the conventional method. 

Another comparison is also of interest. If the analyst chooses not to redistribute 
travel for each new alternative and thereby to deal with a fixed trip table, only the 
travel from zone 1 to zone 3 and from zone 3 to zone 1 would be included. The net 
benefits here would be 2 cost units per traveler x 68 travelers x 2 zone pairs = 272 
cost units, which is much closer to the benefits calculated by the suggested method. 
However, it appears that not many significant observations can be drawn from the 
relative similarity of the values in this case. The methods are simply different from 
each other. However, the suggested method total of 261 cost units is composed of 272 
units of benefit to those who do not change destinations at zone 1 and 3, 38 units of 
consumers' surplus benefits to those who do change destinations [which is shown by 
the triangle labeled B, in Figure 3: (2)(74)(15 - 13)(87 - 68)], and 49 units of disbene- 
fit for those travelers from zone 2, 


Further Observations 


Following the procedure described above may not correct all of the inconsistencies 
between trip distribution and evaluation models. Most frequently, trips are distributed 
by using a friction factor relation or some other function of one variable, travel time. 
On the other hand, the evaluation model may use travel time differences multiplied by 
a value of travel time to estimate the time benefits and may also include other cost 
variables. One approach that might be taken to avoid this inconsistency would be to 
accomplish the trip-distribution process by using a number of combinations of time 
and cost variables, selecting the one that produces the best fit to the observed data, 
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and finding an equivalent price by manipulation of the equations. This price would then 
be used in a consumers' surplus formulation such as the one presented in the previous 
section. 

Also, even if trip distribution and evaluation were conducted consistently, a multi- 
modal planning problem may not be able to also resolve the modal-split and evaluation 
problem discussed in the earlier paper. Efforts to attain consistency in demand models 
are discussed next. 


TOTAL-DEMAND MODELS 


Within the past 5 years or so, a number of investigators, recognizing the incon- 
sistencies in the various demand models and the fundamental commonality of travelers' 
decision-making, have attempted to develop overall demand models for transportation 
planning. Those models place trip generation, trip distribution, and modal split into 
a single estimation process. Although the successful implementation of such approaches 
must solve a myriad of problems, it appears that future research may produce prom- 
ising results that will make the total-demand model more attractive than the currently 
popular sequence of models. 

As an example of the total-demand model, consider the one developed in the North- 
east Corridor Project (2, 3, 4). In the context of this paper, the model can be pre- 
sented as 


D = (Kar cP ryt + ache r7)6 


where 

D = total demand between 2 zones, in number of persons; 

K = variable representing a combination of economic, demographic, and travel 
characteristic variables plus a constant term, all of which do not change with 
changes in the transportation system; 

C: = cost of travel by transit; 

C. = cost of travel by automobile; 
Ti = time of travel by transit; and 
T, = time of travel by automobile. 


The variables a, 8, y, and 6 are constants determined in calibration. 
The demand for the transit mode is computed as follows: 


Dt = (D) [a.c? T7*/(achr?* + once 7)] 


and similarly for the automobile mode. 

After demand is estimated for each transportation system alternative by using a 
model such as that shown above, the planner needs to compare alternatives and to pro- 
vide information that can be used in selecting the one deemed most desirable. Among 
the comparisons usually made are comparisons of traveler benefits, in which travel 
cost, travel time, and other effects are assessed. 

A method of evaluating traveler benefits that is consistent with the travel behavior 
implied by the total-demand model is illustrated by considering 2 alternative trans- 
portation systems having costs and times as follows: 


Alternative 0 Alternative 1 


Cot Cit 
Coa Cre 
To t Ti t 
Toa Tia 


then 


D5. = (K) (we Coe" Tx + ace T72) a 
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and 
dD, = (K) (we cbt Ty: + Qs che pys)® 


The benefits that accrue to travelers between 2 alternative transportation systems 
can be found by using the concept of equivalent price (1). The equivalent price is, as 
defined in the earlier paper, the price that would have resulted in the same new demand 
if only the cost had changed. In other words, a demand D; for a new alternative might 
result from an improvement in both time and cost. The equivalent price would produce 
the same new demand but only with a cost (price) change. The equivalent price (if an 
improvement in the transit system is being analyzed) is found by solving the following 
equation for Cir: 


8 
D, = (K) (wi Ci? 12! + an chs 722) 


If rearranged, 
ch = {[./0)” - (aac %e)] avn? | 


In words, the demand on the new alternative is first found by using the demand equation 
and the values of cost and time for both modes under the new alternative; then, the 
equivalent price is found by using the values of the new demand and the old automobile 
time, automobile cost, and transit time. 

An alternative method of finding the equivalent price would be to use the modal- 
split formula. Here, Ci: is as follows: 


Ch, = ({[ (o:/Die) * 1] (cb Ta')} feed) 


The choice among such alternatives will depend on the mathematics. In some cases, 
one method may not be reducible to an analytical expression, whereas another may be. 

In the operation of the demand model, an improvement in the cost or time of 1 mode 
will cause not only a diversion of trips from the previous mode to the improved mode 
but also an increase in the total number of trips—which is referred to as induced traffic. 

The total traffic on the improved transit mode results from a combination of the 
original traffic, the diverted traffic, and the induced traffic. The diverted and induced 
traffic are as follows: 


Dit ~ Dot = Datvertea + Dinduced 
Dasverted = Doe as Die 
Dinaucea = (Dit - Dot) - (Doa - Die) 


The traveler-benefit calculations can be separated into the 3 groups described in the 
following subsections. 

Original Traffic—For the previous transit travelers who experienced reduced costs, 
the net benefits are easily calculated. 


NBeransst travelers = (Cot = Cit) (Dot) 


Induced Traffic—For the induced travel, arguments similar to those presented in 
the earlier paper can be made regarding the demand for travel, the willingness to pay, 
and the cost of travel. These arguments result in a consumers' surplus approach to 
demand estimation. 

Figure 3 shows a typical demand-curve relation between price and demand. Because 
the following arguments assume a transit system improvement, a transit-demand curve 
is shown. (Similar arguments regarding highway improvements could be pursued by 
using an automobile-demand curve.) 
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Those persons having a willingness to Figure 3. Transit-demand curve. 
pay greater thanthe cost Cp, are thetransit 
travelers under alternative 0 and are rep- 
resented by points on the demand curve to 
the leftof Dot. Those persons represented 
by points on the demand curve to the right 
of Dot have a willingness to pay for transit 
less than Co;. It is important to recognize 
that these persons really consist of 2 groups Cot 
because some are traveling by automobile 
and some arenot traveling at all under al- 
ternative 0. Thus, they are those that may 
be diverted to transit and those that may & 
be induced to travel (by transit), ae 

If improvement alternative 1 were in- 
stalled, at an equivalent price of Cis, the 
total demand for transit would increase to 
Dit. The increase is made up of members 
of each of the 2 groups identified above. 
Even a very small reduction in cost would 
result in an increase in travel traceable to D 
both groups. Therefore, members of both # 
groups lie at all points along the demand DEMAND FOR TRANSIT 
curve. This is an important characteristic 
of the model. 

The group of persons who would be in- 
duced totravel would, in theory, have con- 
sidered both automobile and transit under alternative 0 and decided that they were not 
willing to pay either price. If transit improves and highways do not, such as in alter- 
native 1, the crucial choice of this group is between transit and not traveling. 

Suppose a tiny fraction of the latter group is at the margin under the original con- 
ditions. Because they are at the margin, they perceive no difference in benefit be- 
tween using transit or not traveling. However, if the transit price is reduced, they 
would choose to travel by transit, and their benefit is indicated by the difference be- 
tween their willingness to pay and the price of transit. For the assumed improvement, 
this difference is Co, - Cit. Another tiny fraction of the latter group will be at the 
margin between travel by transit and not traveling if the transit price is Ct. They 
lie at the Di point on the curve, and if the improved transit system were installed, 
the difference between their willingness to pay and the price is Ci - Cit. Similar rea- 
soning can be applied to each traveler regardless of where he is represented on the 
demand curve. 

For all of the persons who would choose to use transit rather than not traveling, 
the total benefits can be derived. If the induced travelers are divided into small m- 
increments depending on their location on the demand curve, the net benefits could be 
estimated by summing the benefits that accrue to each increment as follows: 


COST (EQUIVALENT PRICE) 
Q 


Dy Dit 


m 
NB; = i” (Ciey = Cit) n; 


j=l 
where 
NB; = net benefits from the induced travel for alternative i, 
Cit) = average equivalent cost at a point on the demand curve representing the jth 
increment, 
Ci, = equivalent price of alternative 1, and 
n, = number of induced travelers in the jth increment. 


If the procedure described in the earlier paper is followed, the sizes of the increments 
could be reduced and the number of increments increased until their number approached 
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infinity. Then, methods of calculus could be used to produce a more exact calculation. 
However, these net benefits can be approximated, following arguments similar to those 
presented in the earlier paper, by a consumer's surplus formulation. 


NBinducea = % [Cot - Cit) [(Dit - Dot) - (Don - D;.)] 


This formulation depends on the earlier recognition that members of both groups lie at 
all points along the demand curve. 

Diverted Traffic—For the diverted travelers, similar arguments on the willingness 
to pay can be made; these travelers also consider automobile travel, transit travel, 
and no travel. Under the alternative 0 conditions, the members of the group lying 
along the demand curve between Do; and Dy: choose to travel by automobile. As transit 
improves from Co: to Cit, their crucial comparison is between automobile travel and 
transit travel. 

Under alternative 0, a tiny group of those travelers who lie at Do; on the demand 
curve are indifferent between transit and automobile. They are willing to pay Co, to 
use transit and no more. Similarly, they are willing to pay Co. to use automobile and 
no more, However, if the transit price were reduced, they would choose to travel by 
transit. Their benefit would be indicated by the difference between their willingness 
to pay for transit and the cost of transit. For the assumed improvement, this differ- 
ence is Co: - C%:. Another tiny fraction of previous automobile travelers will choose 
to use transit only if the transit price is less than Cf. They lie at the D{ point on the 
curve shown in Figure 3; and, if the alternative 1 transit systems were installed, the 
difference between their willingness to pay and the cost is their perceived benefit, 

Ct ” Ch. 

Thus, the total benefits can be derived for the group of persons who would choose 
to use transit under alternative 1, those being diverted from automobile. The argument 
is similar to that presented earlier, which displayed summation of benefits over a num- 
ber of traveler increments for the induced travel. Now, the m-increments are incre- 
ments of travelers in the diverted category. 

The result is that the net benefits to the diverted traffic can also be approximated 
by a consumer's surplus formula. 


NBatverted = Th (Cot - Cit) (Dos - Die) 


Total Benefits 


The total net perceived traveler benefits for the improvement in the transit system 
is the sum of the net benefits to the 3 types of travelers. 


NB = NB siiernat transit traffic + NBayvertea tratele + NB gnauced trarfic 
= (Cot - Cit)(Dot) + %e(Cot - Cit)(Doa - Diz) 
+ %e (Cot - Cit)[(Dit - Dot) - (Doa - Died] 


which simplified to the well-known consumers' surplus formula is 
NB = “2 (Cot - Cit) (Die + Dot) 


Suggested Procedure 


The procedure that has been followed in this example to calculate the traveler bene- 
fits can be applied to any transportation system in which a total-demand model is used. 
The following steps should be followed to compute net traveler benefits. The benefits 
should be computed for each zone pair. 


j. Identify those zone pairs that, compared with the base case, have increased 
traffic under the alternative being studied. 
2. Compute the equivalent price C° for each zone pair. 
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3. Compute the consumer's surplus for each zone pair. The expression 
NB = “4 (Cot - Cit) (Dit + Dot) 


would be used in the case of transit improvement. 
4. Sum the net benefits over all improved zone pairs. 


It has been recognized that one difficulty with the Northeast Corridor demand model 
is that it tends to reflect improvements in a given mode in much larger quantities of 
induced traffic and much smaller quantities of diverted traffic than are reasonable. 
Various steps have been taken to correct this problem. 

Although this problem is very real and must be dealt with when the needed capacity 
on the various modes is considered, the resulting consumers’ surplus formulation given 
above mitigates the problem somewhat. The formulation does not require separation 
of the 2 types of increase in travel—induced and diverted. In other words, the individ- 
ual benefits to the travelers who elect to travel by the improved mode is the same re- 
gardless of whether they are induced or diverted travelers. 


A NOTE ON THE INTERPRETATION OF THE ALPHA TERMS 


In the total-demand and modal-split formulations presented in this and the earlier 
paper, the term a is used as a constant term in an expression describing either a dif- 
ference in utility between modes or the utility of an individual mode. 

Such models should normally be developed by experimentation with a number of 
formulations of the various independent variables in order to arrive at an expression 
that most nearly explains the observed behavior. Regardless of the amount of experi- 
mentation or the number of variables included, an unexplained difference invariably 
remains, This difference is represented by the w terms used in the formulations given 
in this and the earlier paper. Some investigators have referred to this unexplained 
difference as a comfort and convenience factor. The author would prefer to refer to 
it as the total unexplained difference, without theorizing any particular name or cause. 

This difference is a difference between explicit modes of travel, between the modes 
against which travel decisions have been studied and the model developed. The explicit 
model should only be used to evaluate changes in the modes that were used in the cali- 
bration process. 

We submit that there is no such thing as an "'abstract modal model," in the sense 
that any technological mode can be studied in the context of a demand or modal-split 
model calibrated for 2 explicit modes. The new technological mode would produce a 
different a term that would stand for the preferences for that mode that are not ex- 
plained by the independent variables, (Similarly, the significance of the independent 
variables under a new mode assumption may be different from that under the condition 
that existed for calibration of the model.) 

This observation implies not only that considerable care should be used in applying 
demand and modal-split models to new modes but also that care should be used in con- 
ducting evaluations using these models. 
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DISCUSSION 
Ezra Hauer, Department of Civil Engineering, University of Toronto 


Haney points out the fallacy of judging the efficiency of a transport system on the 
basis of aggregate cost incurred in travel. The minimization of total cost is a reason- 
able objective when different systems achieve the same utility. But, because alterna- 
tive transport systems lead to different origin and destination linkages (and level of 
trip generation), the condition of equal utility obviously is not fulfilled. Specifically, 
the paper argues that changes in the transportation system induce some travelers to 
reexamine their selections of origins or destinations. For those travelers, before- 
and-after cost comparisons are meaningless because not only did the cost of the trip 
change but so did its utility. 

The logic of the argument presented is sound. Yet, the reader is left with the un- 
comfortable sensation that its practical consequences (increased net benefits to be used 
in justification of transport investment) point in the wrong direction. A stone is being 
added to the benefit side of the scales while society's finger tries to push the pointer 
in the opposite direction. To repeat a familiar argument: Transport investment leads 
to ease of travel, which is conducive to a footloose selection of place of residence and 
work, which in turn generates an apparent dependence of the society on travel and 
makes investment in transport system improvements easy to justify, and so on ad 
infinitum. The outcome of this self-perpetuating process is the present urban struc- 
ture; its diffuse activities render automobile dependence absoiute and the concept of 
"choice through mobility" questionable. For some years now, several communities 
dared to question the wisdom and expertise of planners and opposed their recommenda- 
tions for transport investment. The dilemma is obvious: How is it possible that in- 
vestments, which are justified on the basis of values and preferences of all members 
of a community (as calculated by the planner), are frequently opposed by vigorous 
political action of the very same group? 

The paradox may be easily explained in terms of vocal minorities, professional 

activists, irrationality and misinformation, uneven incidence of costs and benefits be- 
tween groups, imperfections of the political process, or myopic decision-makers. 
The planners, however, must seriously consider whether the paradox stems, at least 
in part, from a professional bias. Specifically, has the planner not been systematic- 
ally more diligent in searching for benefits than in scouting for "costs?" It is on this 
basis that Haney's paper may be found wanting. 

Whether justification of public investment should proceed on the basis of the elusive 
"eonsumer surplus" while investment in the private sector can rarely do so is a moot 
question. But even if the legitimacy of incorporating the consumer's surplus into the 
benefits of investment is not questioned, the suggested evaluation scheme lacks in com- 
prehensiveness. 

It is well known that only a part of the cost associated with the performance of a 
trip is borne by the trip-maker himself. Some of the cost is imparted to his fellow 
travelers in the form of increased congestion, safety hazards, andthe like. This com- 
ponent of the cost should not remain unaccounted for. Its neglect is particularly ob- 
jectionable when, as a result of investment, new trips are being made. In this situa- 
tion, it is the planner who is responsible for the incorporation of costs that the induced 
traveler is incapable of perceiving. The "congestion cost" is not merely academic 
hairsplitting relating to hypothetical situations. It may be seen at work in the common 
example of transport and other investment in the outer reaches of the cities, inducing 
new trip-makers into commuting to the downtown. Although the new travelers indicate 
by their decision to travel the receipt of a net benefit, the added plight of the original 
users remains anonymous and is not added to the accounting ledger. It is, however, 
present in a growing body of public sentiment that questions the desirability of "growth" 
because it usually means deterioration for those already in the system. 

The second component of transport cost unaccounted for in Haney's accounting pro- 
cedure is the cost imparted by trip-makers to nonusers of the transport system. Those 
are simply not present in the model. Yet, almost every major transportation invest- 
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ment in the past decade had to contend with relocation, air pollution, noise, visual 
obstruction, community disruption, and other impacts on nonusers. 

In summary, Haney's method of calculating traveler benefits is certainly rational 
and consistent with currently used models. Lest consistency be construed to mean 
comprehensiveness, I find it necessary to point out that the benefits and costs accounted 
for in this procedure form but a part of the overall impact of transport investment. 
Under no circumstances can it be regarded as a "'total procedure for analyses of bene- 
fits and disbenefits."" And if used so, it will in all likelihood result in yet another con- 
frontation of the public versus the “highway establishment." 


AUTHOR'S CLOSURE 


Hauer's principal concern is that transportation planners may become enamored 
with advancing the state of the art in the estimation of benefits from investments in 
transportation systems at the expense of advancing the art in estimating costs, as they 
are broadly defined. With this point I certainly agree, and I would hope that my earlier 
writings would testify to that philosophy. 

The intent in the paper, perhaps more explicitly stated in the earlier paper (1), was 
to deal only with traveler benefits and costs. To treat isolated problems in methodology 
is appropriate within the format of professional papers, as I am sure Hauer will agree. 

With regard to specific points in his comment, Hauer argues that a biased effort to 
find benefits can lead to diffused urban structure. Perhaps this is so when highway 
planners justify new highway projects. But what about transit planners? Do they search 
to find benefits so that they can counteract the urban-sprawl effect? I think not. I 
would hope that both the highway planner and the transit planner would search for 2 
kinds of optimums: the optimal transportation system and the optimal effect of trans- 
portation in influencing land use. Neither of these is easy to define and measure, but 
I would hope that the procedures presented in this and the earlier paper would lead to 
improved solutions to the problem of finding optimal (balanced) transportation systems. 
At the present time, I would prefer not to argue as to the most attractive land use 
pattern, although a subsequent paper will address itself to a facet of that problem. 
Suffice it to say that the procedures described in the 2 papers could—or might—provide 
the planner with an improved way of assessing the potential of transit systems to con- 
dense patterns of land use as well as of highway systems to diffuse the patterns, Per- 
sonally, I do not think that the effects of different technology cannot exclusively be 
labeled as producing, condensing, or diffusing land use changes. 

Regarding Hauer's discussion of disbenefits to existing travelers, if both the demand 
and supply curves are specified and if the supply curve produces increased cost with 
increased usage, i.e., congestion cost, it is possible to estimate the magnitude of the 
increased cost to the original users. It can be estimated as the difference between the 
actual cost to the criginal users (as well as to the induced users) for the new system, 
less the cost to the original users for the new system had the induced travel not mate- 
rialized. Thus, the disbenefits to original users can be added to the accounting ledger. 


FORECASTING TRAVEL DEMAND 
FOR NEW URBAN TRANSPORTATION SYSTEMS 


Anthony R. Tomazinis, University of Pennsylvania 


The paper presents the experience gained and the conclusions reached 
by a University of Pennsylvania team who carried out the forecasting pro- 
cedures for the minicar transit system as a part of a study by a larger 
group at the university. The technical problems in determining the sys- 
tem characteristics and the levels of service, in defining the travel needs 
of the study areas, and in developing and applying a method of travel- 
demand forecasting for a novice system are discussed in the light of ex- 
perience gained in the minicar transit project. The new system was 
studied for the Philadelphia CBD and the low-income districts that sur- 
round the CBD on 3 sides. Major methodological dilemmas and limitations 
of market analysis tools are also discussed. Forecasts were made for 
manifest travel demand and latent travel needs. 


®THIS PAPER presents the conclusions reached from a case study of the problem of 
forecasting demand estimates for new, untested urban transportation systems destined 
for use within an uncontrolled, competitive, and continuously changing urban environ- 
ment. The system under consideration was one based on small vehicles available for 
rent by the trip, the hour, the day, or long periods of time by any qualified driver. 
The vehicles were called minicars, and the system was called minicar transit system 
or MTS (1, 2, 3, 4). The effort was carried out by researchers of the University of 
Pennsylvania as part of a larger project that included the development of the vehicle 
itself, the MTS, and the determination of the feasibility and desirability of the vehicle 
and the system as a whole. The project was financed by the Urban Mass Transporta- 
tion Administration as part of its program in developing new systems. 


THE PROBLEM 


The technical problem in this aspect of the project was primarily based on estimat- 
ing with satisfactory reliability the extent to which the new minicar system will be uti- 
lized by its prospective users. This simple definition of the problem is sufficient to 
indicate a number of particulars of the overall problem. The transportation planner 
was asked not only to forecast the potential demand for a new system but also to sug- 
gest what would be the location of the most desirable test and application of the new sys- 
tem. In the case of the minicar project, the efforts focused on testing the feasibility 
and desirability of such a new system in 2 parts of the Philadelphia region: the Phil- 
adelphia central business district and the low-income ring that surrounds the CBD on 
3 sides. The 2 areas are to be taken together or separately. 

A second major problem faced from the outset was the determination of both the 
technological system characteristics and the definition of the services that the system 
would provide for the study areas. It is of particular interest to note that, in develop- 
ing a new transportation system and in testing the desirability and feasibility of such 
systems in the early stages of the effort, it is necessary for the transportation planner 
to undertake the responsibility of suggesting both the desirable technological character- 
istics of the new system and the type and pattern of services that the new system may 
be called on to offer. This was also true in the case of the minicar project. 

These and several other problems form the core of any effort to forecast travel de- 
mand for new systems. The steps that ought to be followed were complicated and re- 
quired a sequence that was supported by logic and was consistent with the availability 
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of the data sets that were necessary for the completion of the project. Figure 1 shows 
the various steps and the sequence of actions that were developed and applied in the case 
of the minicar project in response to these methodological requirements. 

What follows in this paper is a discussion of the various phases of analysis and the 
conclusions reached in performing the various tasks. 


PROBLEMS OF AREA SELECTION AND LEVEL OF AREA ANALYSIS 


Several problems of particular significance arise as soon as the area of the potential 
application of the new system is considered. One must immediately refer to the central 
goals of the entire effort and the initially postulated capabilities of the new system. For 
instance, in our minicar project the central goals of the effort were the alleviation of 
street traffic congestion and the provision of an additional means of transport to those 
urban residents who did not hiterto have a satisfactory choice of travel mode. Also, 
from the outset, the new mode of travel was defined as a means of serving short-to- 
median-length urban trips of those who can drive and of providing the service data cost 
somewhat below that of the private automobile and taxi and somewhat above that of the 
public transit system. In response to these 2 essential determinations—the central 
goals and the essential characteristics of the new system—the research team selected 
2 areas for testing the initial system: the Philadelphia CBD where all types of urban 
traffic congestion occur in their worst form and the low-income districts that surround 
the Philadelphia CBD on 3 sides. The choice of the test areas or study areas was the 
start of an extensive analytical process to determine 4 essential aspects of each area, 
as follows. 


1. Determination of the boundaries of each area proved to be critical in our effort 
in both areas. At least 2 major considerations enter here: One is the intrinsic charac- 
teristics of the area, and the other is the functional requirements of the system. With 
regard to the first consideration, it is frequently clear that neither does the CBD of a 
region have a demarcation line that separates it from the next block of activities nor do 
the low-income areas have a distinct demarcation point (physical or economic) that sep- 
arates them from areas with slightly higher income levels. With regard to the second 
consideration, the requirements of a "systems testing"’ imposed 2 additional actions. 
First, a small "island" of high-income households in the midst of a low-income district 
or a park area in the midst of the CBD would have to be included in the overall outline 
of the study area. Second, the core study area would have to be related and function- 
ally associated with the surrounding area with which the "study area" interchanges the 
vast majority of its trips. Thus, 2 study areas become apparent: the core study area 
(CSA) with which we were primarily concerned and the regional study area (RSA) within 
which the CSA is located and with which the CSA constantly interacts. 

2. A second aspect of the problem was a conflict between our project objectives and 
the availability of data for each component of the study area. It soon became clear that 
restrictions on the outline, size, and even location of the study area ought to be accepted 
because of data requirements. 

3. The third aspect of the problem that rapidly emerged was the appropriate level of 
analysis of the conditions prevailing in the study for the purposes of the project. As the 
analysis of the conditions prevailing in the study areas and of their travel needs was 
carried out, it became necessary to frequently impose severe restrictions on the in- 
dividual analysis in order to avoid aimless analytical ventures that might be of interest 
but that had very little relation to the objectives of the project. 

4, The determination of indexes of service deficiency was the fourth major aspect 
of the problem. This requirement is indeed of particular significance and is especially 
complex. It involves both the physical and the operational characteristics of the present 
and planned systems as well as an essential determination of ''standards'' and service 
levels that are either desirable or feasible or both. It seems that an incomplete, short- 
sighted, or utopian determination of standards or service levels may prejudge the out- 
come or even the evolution of the entire test of the new system. In our case we found 
particular difficulties in establishing widely acceptable and quantifiable concepts of 
latent travel demand for the low-income areas and in establishing satisfactory differen- 
tiations between public and private concepts of requirements for service in both core 
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Figure 1. Forecasting procedure of minicar project. 
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Table 1. Minicar system setups for low-income areas. 


Maximum 
Automobile Fee on 
Minicar Terminal Parking Round 
Fare Value of Time Spacing Cost Trip 
Setup (cents/mile) (dollars/hour) (mile) (cents) (dollars) 
I 20 1, 2, and 3 1 = 2.00 
I 20 1, 2, and 3 hs 10 2,00 
m 15 1, 2, and 3 ‘A 10 1.50 
Iv 10 1, 2, and 3 hs 25 1.00 
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study areas. Even the determination of the "travel needs" of the study areas proved to 
be particularly complicated when the introduction of a totally new system with smaller 
and different use limitations was taken into account. 


SYSTEM DETERMINATION FOR TESTING 


The system determination for testing involved 3 components. One was the outline 
and the "firming up" of the physical and technological characteristics of the system. 
In addition to the means of costing a trip, such characteristics in our case proved to be 
the vehicles’ operating speed, capacity, maneuverability, driving requirements (i.e., 
licensed drivers required, physical availability in parking lots and garages (a minimum 
delay in picking up and delivering vehicles), and driving reliability. The second com- 
ponent was an early determination of the unit cost of travel in a manner that included 
all pertinent costs such as vehicle costs, costs of parking and storing, costs of account- 
ing and billing, costs of repairs, accident costs, and all other overhead costs of the 
system. The third component of system determination for testing was the early areal 
distribution of the system associated with an early determination of the types of services 
(trips) the new system will be able, appropriate, and available to compete for. 

All 3 components of system determination for testing proved difficult to define in 
our project. In general the more novice the system is and the more alternatives it 
makes possible, the greater is the difficulty in firming up a total system for testing 
purposes. In the case of anticipating adoption of a new system within the framework of 
a free urban society and in an open competition with all the other systems, the deter- 
mination of the new system for testing purposes is as critical as any other aspect of the 
overall effort. The transportation planner can right there "kill" the project or, on the 
other extreme, delude himself into believing the superiority of the idea he is testing. 


GEARING THE SYSTEM FOR TESTS 


At the completion of the 2 first phases (system determination and study area selec- 
tion and analysis), the stage was set for gearing the system for comparative simula- 
tion tests. The work in this phase could most reasonably start from a determination of 
what and how many combinations of systems and services will be tested. In our case 
we felt it necessary to test at least 2 alternative setups for the CBD and 4 setups for 
the low-income areas. Table 1 gives the setups for the low-income areas. Several ele- 
ments can be varied at this early stage in the search for various definitions of optimum 
system. In the minicar system considerable variation was available in setting the fare 
levels, in determining the maximum cost of a trip, in assuming different values of 
time for different groups of users, in determining the frequency of system terminals 
and hence the level of availability of minicars, and in assuming parking cost for the 
private automobile. Additional items could be varied but the ones already included 
presented enough complications to restrict us from expanding the list. 

The notions of optimum system were also quite variable and only to a limited extent 
subject to analytical methods of optimum determination. This is so because in only a 
few cases would the new systems be called on to simply maximize their revenue or to 
simply produce most efficient services. For most cases, the type of service offered, 
the group of beneficiaries, and the impact on other competitor or supplementary systems 
are of equal significance in current testings of new systems. In fact, concepts of this 
broader type have in the recent past cast unfavorable light on many new systems that 
otherwise would appear desirable and ready for development. 

The determination of distinct combinations of services and facilities to be tested 
leads to the development of system variables that would, from then on, represent the 
system. This aspect of the undertaking produces a welcome clarification in the minds 
of the analysts with regard to what the new system can offer. The system variables 
express in essence the operating characteristics of the system and represent the dimen- 
sions along which the new system can be measured and compared both with expressed 
travel needs of the study area and with the capabilities of the competitive systems. 

For our minicar project, we found the need to carry investigations that measure out-of- 
pocket requirements, running time requirements, and excess time and cost requirements 
for the completion of the trip. Unfortunately, no measures could be carried out with 
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regard to the system availability to all its potential customers or with regard to com- 
fort, convenience, safety, and reliability of the new systems. Thus, we had to assume 
that these systems variables would not be any different from the ones characterizing 
the competitive systems and, therefore, would not produce any influence anyway. 

The measurement of system variables involves both the new system and all the 
other systems currently in use or contemplated to be in use by the time the new 
system will be in place. The actual mechanics, however, involve both abstract or aver- 
aged measurements and specific measurement of interchanges for which more than 
one system will be available. Thus, it is an important step in the whole process to 
determine the extent to which detailed measurements will be carried out. There are 
2 approaches at this point. The analyst can either produce a blanket measurement of 
system variables for all probable interchanges or select only a set of representative 
interchanges for which detailed measurements can be made. The choice depends on 
the availability of funds and time. The first approach is all inclusive and presents a 
plethora of measurement points. In our minicar project, we selected the second ap- 
proach primarily because of limitation in money and time. However, we included 
enough representative measures to cover the spectrum of probable trip lengths and sets 
of circumstances that may influence measurements in the new system or the competitor 
system. 


SELECTING FORECASTING PROCEDURES 


The entire effort is brought to its most critical point at the moment the choice for 
a specific forecasting procedure is made. There are several approaches in forecast- 
ing travel demand for a totally new system. In general they can be classified into 2 
groups: the manifest preference method and the expected preference method or alter- 
natively the travel needs simulation method and the market analysis method. Both ap- 
proaches include a number of variations andare subject to ad hoc combinations of methods. 
(The methods and techniques of the abstract-modes models can easily be seen as fall- 
ing within the manifest demand classification because they clearly utilize a manifest 
data base (5, 6). 1 In general the market analysis approach would proceed with one of 
the many possible forms of questionnaires on the basis of which the potential consumer 
will be asked to express ana priori judgment and preference about what he would consider 
preferable and what he would do in case the new system becomes available to him. The 
exercise is accompanied with a brief description of the new system and with a request 
to express a choice, all other things being equal. One of the most sophisticated ques- 
tionnaires in this group is the one that is usually called a discriminant questionnaire in 
which the potential customer is asked to express an incremental like and dislike for the 
various characteristics of the new system as part of a technique usually referred to as 
semantic differentials. Variations of this method are used by most major manufactur- 
ing companies (the automobile industry included) in testing what their new products will 
encounter in the market or how their new products must be improved (7). 

The market analysis approach is based on market segmentation from the outset, and 
in many cases it is reinforced with a product clinic. The term "product clinic" is given 
to a special effort, usually carried on annually by the automobile companies, in which 
the new product is exhibited and explained to successive segments of the market. At 
the end of each routine, the potential consumers are usually asked to complete a ques- 
tionnaire indicating their feelings, preferences, and choices on the basis of what they 
know, what they have seen, and what they have been told about the new product. 

In contrast to these market analysis techniques, the analyst has available for his 
use considerable data that indicate what the potential consumers have already done in 
cases where a choice was involved. The approach is based on manifest behavior and 
preferences, consciously or unconsciously made. If, then, the analyst succeeds in re- 
lating the manifest choice with the circumstances within which the choice was made and 
with the relative characteristics of the competing systems, he can assume that in similar 
cases in the future the potential consumer will act similarly. This approach is the 
essence of modal-split analysis and projection carried out by most urban transportation 
studies. In doing so, the analyst has available several methods that he may employ 
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such as correlation analysis, diversion curves, abstract-modes models, and stochastic 
choice models. 

As the minicar system forecasts were carried out, the dilemma of the approach to 
be followed was felt in its full force. The new system was sufficiently novice in its 
physical outline and operating characteristics to warrant market analysis experimenta- 
tions and product clinic findings. The new system was, however, sufficiently similar 
to present systems to render the findings of other modal-split studies very useful in 
forecasting usage. After all, cost variations and travel time variations have been in 
the heart of modal-split analyses of most major modal-split studies. 

As a result of these realizations, the team proceeded by utilizing both approaches 
but placing major emphasis on the manifest preferences method. A number of market 
analysis studies were made for most segments of the potential market. In addition, a 
product clinic was held in a special room of the university for about a month to which 
more than 100 groups were invited in sequence. The market analysis studies were 
small in size and attempted to measure probable future preferences in specific cases. 
The product clinic participants were given the opportunity to inspect and experience the 
minicar itself, review the results of the first phase of the study, listen to additional 
explanations, and then carry on a conversation with key members of the research team. 
The invited groups varied from leadership groups such as councilmen and civic leaders 
to small street-corner groups and informal associations. At the end of each session 
each participant was asked to complete a questionnaire. 

The main forecasting effort proceeded, however, by utilizing data and conclusions 
based on the 1960 home interview survey of the Penn-Jersey Transportation Study. 

The target date was 1975 for which a preliminary regional transportation plan and pro- 
jection was made by the Delaware Valley Regional Planning Commission in the period 
1963-66. 

The application of a modal split approach in determining the minicar share of the 
manifest trips involves still many steps and assumptions that must be carried out before 
the process is completed. The first major commitment to be carried out was the exact 
method to be utilized in estimating the share of the minicar trips, among the methods 
available, the conclusion was in favor of developing a set of synthetic diversion curves. 
The correlation analysis was excluded because of both the need it has for empirical 
data in deriving the coefficients of the equations and the undue strictness of the rela- 
tions that the equations convey. The stochastic approach was also rejected from the 
outset because of the undue articulation of the hypothesis and relations that the approach 
requires and that in no way could be met in this speculative and experimental study. 

The set of synthetic diversion curves was developed by using the experience gained 
in the recent past in developing modal-split diversion curves in actual situations and 
by utilizing also a set of logical-commital statements that establish the 2 extreme points 
of each curve as well as its midpoint. Then by assuming the applicability of the economic 
principle of diminishing marginal returns, the team formulated a set of diversion curves 
approximating the shape of diversion curves of the recent past. Figure 2 shows the re- 
sultant curves for 2 sets of choices. These curves were then utilized in association 
with a composite system variable developed with the total travel cost of each interchange 
under consideration. 

The total travel cost variable was formed as the key variable for the diversion curves, 
against which the share of minicar travel could be estimated. The variable represented 
the summary cost of all out-of-pocket costs, the time cost, and an approach time cost at 
the 2 ends of the trip. The minicar fare was explicitly counted on the basis of the as- 
sumptions advanced in each system setup for testing. The automobile costs were in- 
clusive of operating costs and fixed costs of depreciation and insurance. Varying 
automobile parking costs were added. For transit and taxi trips, the fares of each 
trip were explicitly included. The total costs of each system were then made to form 
ratios that were further investigated with respect to trip length, value of time, level of 
parking costs, and amount of approach costs for each trip. Figure 3 shows some of the 
variations of the cost ratios with trip length and value of time for minicar versus auto- 
mobile and minicar versus transit. 

On the basis of this set of assumptions, presumptions, and partial calculations, the 
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Figure 2. Synthetic diversion curves for low-income areas. 
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forecasts for the share of minicar trips from the total manifest trip projection of the 
residents in the region were made for 1985, The essential question that the team was 
attempting to answer was, What would happen to the trip patterns by mode if, in addi- 
tion to what exists and is planned in the mean time, a minicar transit system were in 
operation in the study areas in 1975? The first part of the answer was to be derived 
through these sequential and somewhat complicated steps. However, in addition to 
what one can expect to occur in shifting trips from one mode of travel to the other, 
there is a serious reason to expect that additional trips will be made in 1975 if the 
new system is in operation, because of the existence of the new system. What is in- 
troduced here is the possibility of realizable latent travel demand due to the character- 
istics of the new system. The essence of latent travel demand is that a real need anda 
desire to make a trip are surpressed because of the limitations imposed by the system 
on which the trip must be made. Thus, when a bus line stops operating after 10 p.m. 
transit trips that would be made at that time have to be postponed or altogether canceled 
if the trip-maker cannot make use of an automobile. The cost to the individual of the 
canceled or postponed trip cannot be easily conveyed but is nonetheless real. Environ- 
mental limitations or constraints are also equally frequent and real. Trips must be 
postponed, combined, or outright canceled if the trip-maker cannot bear the direct 
cost of the trip in money or time, regardless of any indirect benefits to the potential 
trip-maker. Thus, the need and desire for these trips constitute the latent travel de- 
mand for the given set of users and for the given set of circumstances. When the new 
system is introduced with its essentially different levels of availability, low cost, and 
high speeds, it is bound to make a proportion of the existing latent travel demand 
realizable and thus manifest for 1975. This is what other researchers some years ago 
called induced trips, i.e., trips that the new expressway systems were found to produce. 
The methods of calculating latent travel demand are still in their infancy. They are 
usually based on measuring the elasticity of a demand function with respect to either 
the income of the consumer or the price of the commodity. A similar approach was 
followed in the minicar project (although the team avoided producing a mathematical 
solution by taking the partial derivative of the demand function with respect to the unit 
cost of the minicar trips). In general, the procedure followed was to "estimate"! the 
elasticity of demand due to the price changes within the guidance we found in specifying 
the shift of trips from one mode of travel to the other due to price changes for manifest 
travel patterns. The forecast total travel demand was the summary of the 2 partial 
demand estimates, i.e., the manifest and latent travel demand of the study area. 


SENSITIVITY OF DEMAND FORECASTS 


The long sequence of steps required to produce a forecast for travel demand of 
a new urban system plus the extensive number of assumptions and presumptions that 
the analyst has to undertake in order to complete the missing pieces of information 
necessary to his work clearly indicate that at the end of the process he possesses an 
estimate with considerable limitation. This realization, together with the fact that a 
limited number of system setups tested represented only a small part of what can be 
devised and tried in the future, imposed a clear need to investigate and specify as clearly 
as possible the sensitivity of the forecast estimates to the several important system 
variables utilized in the process. This was exactly the concluding part of the minicar 
project forecasting effort. 

Usually there are 2 types of variables: the variables that are endogenous to the new 
system (which are also under considerable design control) and the variables that are 
exogenous to the new system per se but endogenous to the study area. In the minicar 
project the endogenous variables were the minicar fare structure, the spacing of the 
terminals, and the conditions and delays of the vehicle pickup and delivery. Exogenous 
variables were the value of time, the automobile parking cost and availability, and the 
operating costs of automobiles and the fares of transit and taxis. The sensitivity analy- 
sis carried as part of this project imdicated that the forecast estimates were extremely 
sensitive to the endogenous and exogenous variables. Further, the type of trips attached 
to and generated by the new system proved also to be very sensitive to variations of 
many of the endogenous and exogenous variables. In fact, our analysis indicated that 
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the flexible variables tended to be more important than the plain technological charac- 
teristics of the new system per se. 

This sensitivity to system variables was considered to constitute both an important 
risk in introducing the new system and an important policy tool in influencing the suc- 
cess of the new system and directing the impact it may have on automobile trips, tran- 
sit trips, and taxi services. In essence, extensive knowledge of what influences the 
new system would have, and of the way the various influences would be felt, was found 
to be an important element in managing such a system and in interweaving it within the 
fabric of other urban transportation systems and services. 


CONCLUSIONS 


The conclusions of this case study, above and beyond the specific findings of the 
minicar project, led the research team into a series of reappraisals of the entire 
effort. In many respects, the team realized time and again that the available methods 
and techniques in the field are indeed of limited ability in forecasting travel demands 
for new systems for use within modern urban areas. The need for relevant empirical 
findings was indeed pervasive. On occasion it seemed that nothing could make up for 
lack of data from direct experiment. Nonetheless, it was also felt in such early stages 
of research for a new urban transportation system even the most fundamental decisions 
(i.e., the most appropriate study area and the most pervasive technological charac- 
teristics of the system must be stressed) did not have sufficient grounds for a sound 
answer. Thus, the team concluded that the most productive approach would have been 
an interplay between office research and system simulation on the one hand and actual 
experimentation on the other. 

Barring this interplay, the analyst would be advised to select an approach that uti- 
lized the most direct methods of analysis with the most feasible openness on the re- 
quired assumptions and presumptions necessary to be made in the process of forecast- 
ing. The market analysis approach yielded limited results and helped primarily to in- 
crease the understanding of a few critical segments of the potential market. The prod- 
uct clinic was also helpful to a very limited extent for several reasons, among which 
the difficulties the team encountered in bringing in larger crowds and in communicating 
the whole purpose of the project were prominent. Concerns over "salesmanship," 
"profits," and "propriety'' increased as the idea was transformed gradually from a 
pure speculative research notion into a physical, probable system. In overall terms, 
the transportation planning method that was based on past data from an origin and des- 
tination study and on findings of previous modal-split analysis proved to be the "'sav- 
ing method" that permitted the team to complete its efforts and to produce a set of 
forecasting estimates for limited use and some indication of their sensitivity to the im- 
portant system variables. This was so because of the availability of several central 
concepts in this approach and the availability of a plethora of partially relevant data. 

Among the numerous dilemmas that were faced in this effort, the ones related to the 
questions of Who will generate the system data? Who should make the critical assump- 
tions? and What analytical tools should be selected? were the ones that permeated the 
effort from its beginning to its conclusion. It seems that these questions are bound to be 
in the center of concern of any similar effort. No hurried answers would be advisable 
to these questions because they are only partly technical. Their essential nature is 
social, political, managerial or financial or all of these. Thus, in most cases, the 
answers must come primarily from the sector that carried the responsibility of im- 
plementation or the group that would either be the potential users of the system or feel 
the impact of the system otherwise or both. The researcher or the planning analyst 
in these cases should be satisfied to take the back seat and not to make the numerous 
decisions that predetermine so much of the answer. 
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TRAFFIC ASSIGNMENT WITHOUT BIAS 


Alvin H. Benesh, Federal Highway Administration, U.S. Department of Transportation, 
Pierre, South Dakota 


Traffic assignment procedures based on shortest time path tend to over- 
assign traffic to high-speed facilities. Because of this, 2 assignment for- 
mulas were developed that, taken as parametersinthe assignment process, 
eliminate the overall bias. One formula, known as the time-distance factor, 
was described in an earlier publication. The formula introduced in this 
paper, called the exponential factor, is based on a combination of link 
distance and speed raised to an exponential power. Both were tested by 
using data from 9 different studies in 6 states. When assignments were 
based on time, the mean overassignment to the high-speed facility was 45 
percent. At the optimum adjustment in the formulas this difference was 
eliminated, and the percent rms error was less than a third of the error 
in assignments by the shortest time path. A count of assignment errors in 
individual zone-to-zone movements revealed that in assignments by the 
shortest time path there were 18 times as many overassignmenterrors as 
underassignment errors. At the optimum assignments by the 2 formulas, 
the number of overassignment and underassignment errors was about in 
balance, and the total number of assignment errors was only about 60 per- 
cent of that based on assignments by shortest time. Also the traffic vol- 
umes assigned closely matched those actually counted. 


®BASIC to any procedure for assigning traffic to a network by the shortest path is the 
development of a measure or parameter that will in general route a trip to the correct 
highway. If the measure used gives close to an optimum assignment, then various 
calibration and restraint procedures to improve the assignment further will involve 
only very minor changes in speed. On the other hand, if the measure of the spatial 
relation in itself has a strong bias, then the procedures will involve much larger 
adjustments to overcome the bias. The result will be a coarser assignment with re- 
gard to actual traffic and speeds. 

In a previous paper (1) it was shown that the use of travel time for this measure 
results in too many trips being assigned to the high-speed facility. Conversely, the 
use of shortest distance gives too few trips to the high-speed facility. 

One of the purposes of this paper is to document more thoroughly those statements 
by using a greatly expanded amount of data and new research techniques and then, 
by applying these statistical techniques, to ascertain the proportion of time and distance 
in a time-distance (TD) factor previously described (1) that will give the best 
assignment. 

The principal purpose is the introduction of a new parameter, the exponential factor, 
for traffic assignment. It gives results quite similar to those obtained by use of the 
TD factor, but, because the formula involving the use of speed to an exponential power 
is completely different, it should prove of interest and eventually of considerable 
value to the transportation planning profession. 


THE TD FACTOR 


The formula for the link length for assignment purposes is based on a combination 
of time and distance, as follows: 


TD factor = TP + D (1.00 - P) 
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where 


T = time, min; 
D = distance, miles; and 
P = decimal fraction of weight assigned to time. 


When P = 1.0, the assignment is based on travel time; when P = 0, the assignment 

is based on distance. Between these 2 extremes, any value of P can be tried until one 
is found that will give the best results as determined by the root mean square error 
for the entire network. Once a value of P is decided on, the same constant value of it 
is used for the entire network assignment. 


THE EXPONENTIAL FACTOR 


The formula for the link length for traffic assignment purposes is based on a com- 
bination of distance and speed raised to an exponential power, as follows: 


Exponential factor = D(K/V)" 
where 


D = link distance, 
V = speed, and 
K = arbitrarily selected constant. 


It is believed that the value of the exponent N will never exceed the range of N = 0 to 
Ned, 

Suppose that V is expressed in miles per hour, Din miles, and K = 60. When N= 1; 
the expression becomes 60D/V, which is the formula for travel time in minutes. When 
N = 0 the fraction (K/V)° becomes 1 and the formula ends with only the link distance. 

For a sample computation of an exponential factor using an exponent of 0.5, assume 
that the times, distances, and resulting speeds between a pair of zones are as follows 


(1): 


Time Distance Speed 
Facility (min) (miles) (mph) 
Freeway 12.5 8.3 39.8 
Alternate 13.2 6.4 29.1 
Freeway exponential factor (N = 0.5) = 8.3 (60/39.8)°"° = 10.20 
Alternate exponential factor (N = 0.5) = 6.4 (60/29.1)°*° = 9.19 


Because the alternate route has the smallest assignment factor, it will be assigned the 
trips. 

The exponential factors for various values of N are shown in Figure 1. The data 
are plotted on full logarithmic scale so that the curves will be straight lines. Figure 
2 shows a comparison of the exponential factor and the TD factor. The exponential 
factor with N = 4 agrees quite closely with the TD factor using / T and 73D. In the 
exponential factor curves, the constant K had been set at 60 in order to give an assign- 
ment factor of 1 at 60 mph. (With N = 1, this also gives the curve for travel time.) 

As assignments are based on the relative link lengths, K could just as well be 1. If 
we make K = 1, the exponential factor becomes simplified to the following: 


Simplified exponential factor = D/Vy 


SOURCES OF DATA 


To test the formulas required the following for each zone-to-zone movement, which 
represents a group of trips, all originating in one zone and terminating in another: 
total number of trips, trips using freeway, travel time and distance via freeway, and 
travel time and distance via alternate route. Published information was available from 
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Figure 1. Comparison of exponential factors for Table 1. Trips between zones and on 
values of N. facilities. 
Trips Trips Zone-to- 
Between on Zone 
Facilit Z Facilit; Movements 
3 EXPONENTIAL | Baste ane Sn 
” 
Be 4 Urban freeway 
= FACTOR =o() Gulf 22,556 8,413 19 
$ Alvarado 92,278 23,856 154 
< Cabrillo 61,140 27,060 101 
Oceanside 6,904 2,823 53 
Shirley 19,756 8,152 88 
Small city bypass 
Kokomo 5,526 2,053 69 
Lebanon 6,110 4,464 50 
Rural Freeway 
Kansas 6,054 3,967 46 
South Dakota 7,388 4,388 88 
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Figure 2. Comparison of exponential and TD factors. 
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8 separate studies. In addition, an unpublished report on the Gulf Freeway in Houston 
was obtained from the Texas Highway Department. This made a total of 9 sources of 
data from 6 states, as follows: 


Urban Freeways 
Gulf Freeway, Houston (2) 
Alvarado Expressway, San Diego (3) 
Cabrillo Freeway, San Diego (3) 
Oceanside-Carlsbad Freeway, California (3) 
Shirley Highway, Arlington and Fairfax Counties, Virginia (4) 
Small City Bypasses 
Kokomo, Indiana (5) 
Lebanon, Indiana (5) 
Rural Freeways 
Interstate 70, Kansas (6) 
Interstate 29, South Dakota (7) 


METHOD OF ANALYSIS 


Data from these studies were used to write a computer program that provided a 
tabulation of the following data for each study: Number of zone-to-zone movements, 
number of trips actually using freeway, number of trips assigned to freeway, percent- 
age of zone-to-zone trips assigned to freeway, number of trips assigned to freeway as 
a percentage of actual trips using freeway, number of zone-to-zone movements erro- 
neously overassigned to freeway, number of zone-to-zone movements erroneously 
underassigned to freeway, and total number of zone-to-zone movements erroneously 
assigned. The actual and assigned trips are self-explanatory. The erroneously 
assigned trips can be described as follows. 

In an assignment by shortest path method, there are only 2 assignments possible. 
Either all zone-to-zone trips are assigned to the freeway, or no trips are assigned to 
it. For example, suppose that 49 percent of the trips represented in a zone-to-zone 
movement actually use the freeway. Then the best possible assignment to the freeway 
by this procedure would be 0 trips, and such an assignment would be considered cor- 
rect. However, if all of the trips were assigned to the freeway, the assignment would 
be incorrect and the computer would classify the assignment as a plus error. Examples 
of the assignment classifications that are possible for the percentages of zone-to- 
zone trips actually using the freeway and assigned to the freeway are as follows: 


Actual Assigned Classification 
49 and under 0 Correct 
49 and under 100 Plus error 
51 and over 0 Minus error 
51 and over 100 Correct 


The assignment data were computed and tabulated in increments of 0.1 for the value 
of P in the TD factor and of N in the exponential factor. This gave 11 assignments for 
each factor for each study. In addition, in the range from 0.46 to 0.24, assignments 
were made in increments of 0.02, which improved accuracy in the optimum range. 


ANALYSIS RESULTS 


Total Traffic Assigned 


As indicated in the earlier report on the TD factor, all assignments to the higher 
type of facility were too high by the shortest time path and too low by the shortest 
distance. The percentages for each of the 9 highways are given in Tables 2 and 3. 

The percentages for each highway at the optimum exponential factor and TD factor for 
the group are also given. Additional detail on number of trips assigned is given in 
Table 1. The tables give assignments at intervals of 0.2 in values of N and P, plus the 


Figure 3. Percentage of actual trips assigned by exponential 
factor for values of N in intervals of 0.1. 
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Figure 4. Percentage of actual trips assigned by TD factor for 
values of P in intervals of 0.1. 
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Table 2. Percentage of actual trips assigned by 
exponential factor for values of N in intervals of 0.2. 


Facility 1.0° 0.8 0.6 0.46 0.4 0.2 0.0° 
Urban freeway 

Gulf 159 141 114 81 B1 56 4 

Alvarado 183 144 119 101 90 GO 42 

Cabrillo 161 149-133 94 BO 69 54 

Oceanside 1446 «#140 «#1400 «115 113° 7558 

Shirley 114105 75 72 7 5548 
Small city bypass 

Kokomo 169 «159 148 «104 100 96 86939 

Lebanon 120 «102 101 96 95 95 4 
Rural freeway 

Kansas 125 124 123 108 108 89 = 82 

South Dakota 126 «©1250 «118 «IT 116 «©9081 
Mean 145 9132 119 99 95 76 50 
“Shortest time. "Optimum. "Shortest distance. 


Table 3. Percentage of actual trips assigned by TD 
factor for values of P in intervals of 0.2. 


Facility 1.0 860.8 0.6 0.4 0.32 0,2 0.0° 
Urban freeway 
Gulf 159 159141 114 81 69 AL 
Alvarado 183 4152 134 119 90 66 42 
Cabrillo 161 153 148 «111 97 17 54 
Oceanside 146 140 140 140 113 106) = 58 
Shirley 114109 104 82 712 5B (48 
Small city bypass 
Kokomo 169 166 156 141 115 98 39 
Lebanon 120 «#112 = #=«6101 101 101 95 4 
Rural freeway 
Kansas 125 «124 123 108 107 89 82 
South Dakota 126 125 118 117 115 94 «81 
Mean 145 #138 129 115 95 84 «50 


“Shortest time. 


Shortest distance. 
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assignments at the optimum values of N and P. Computations, however, were made 

at intervals of 0.1, and plots of the assignment data for the exponential factor and 

the TD factor on this basis are shown in Figures 3 and 4. The number of lines, one 
for each study, tends to slightly obscure the basic trend; therefore, Figures 5 and 6 
show the mean values with boundary lines at 1 standard deviation from the mean. 
Assignments of 99 percent of the observed number of trips are obtained at a value of 

N = 0.46 in the exponential factor and P = 0.32 in the TD factor. As N and P were 
computed at intervals of 0.02 in the optimum range, the 99 percent figure is the closest 
one based on an actual assignment. At N and P values that give an average assignment 
close to the actual volumes, the dispersion about the mean, as measured by the stan- 
dard deviation, is about a third less than the dispersion when assignments are by the 
shortest time path. 


Root Mean Square Error 
The formula for the computation of the rms error is as follows: 


rms =V [2D (Kee = XP V(N = 1) 


where 
Xec = total vehicles counted on a given route, 
X, = assigned volume to the route, and 
N = number of routes. 


It is possible to interpret the results of the computation given above better in this case 
if the reader can review a sample computation. The following one is for shortest time 
path. 


Actual Assigned 


Facility Trips Trips 

Facility Type (percent) (percent) Difference Difference” 
Gulf Freeway 37 59 -22 484 
Alternate 63 41 +22 484 
Alvarado Freeway 26 47 -21 441 
Alternate 74 53 +21 441 
Cabrillo Freeway 44 Tl -27 729 
Alternate 56 29 +27 729 
Oceanside Freeway 41 60 -19 361 
Alternate 59 40 +19 361 
Shirley Freeway 41 AT -6 36 
Alternate 59 53 +6 36 
Kokomo Bypass 37 63 -26 676 
Alternate 63 37 +26 676 
Lebanon Bypass 73 87 -14 196 
Alternate 27 13 +14 196 
Kansas Freeway 66 82 -16 256 
Alternate 34 18 +16 256 
South Freeway 59 75 -16 256 
Dakota Alternate 41 25 +16 256 
Total 900 900 0 6,870 


Total alternates and freeways = 18 

Average trips on each, percent = 900/18 = 50 
rms error = V6,870/17 = 20.1 

Percent rms error = (20.1 x 100)/50 = 40.2 
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Figure 5. Mean percentage of actual trips assigned by exponential factor for 


values of N in intervals of 0.1. 
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Figure 6. Mean percentage of actual trips assigned by TD 
factor for values of P in intervals of 0.1. 
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Figure 7 shows rms errors for the exponential factor and TD factor. In the optimum 
range for each method, the percent rms error is less than a third of the error in 
assignments by the shortest time path. 


Movements Incorrectly Assigned 


This analysis is probably the most significant part of the report. Results based on 
route totals alone do not take into account the variation in accuracy that is possible 
from one end of the route to the other. A count of the number of zone-to-zone move- 
ments incorrectly assigned, classified according to the number of plus errors and 
minus errors, gives a better insight into the workings of an assignment procedure. 

Of course, in an all-or-nothing assignment there are only 2 possibilities: either 0 
percent or 100 percent of the trips represented in a zone-to-zone movement are 
assigned to the facility. In this analysis, therefore, the assignment that comes closest 
to the actual is considered the correct one, even though numerically it may be consid- 
erably different from the actual. Thus, if 37 percent of trips represented in a zone-to- 
zone movement actually uses a freeway, an assignment of 0 percent would be correct, 
and an assignment of 100 percent would represent a plus error. If 80 percent of the 
trips in a zone-to-zone movement actually uses a freeway, an assignment of 0 percent 
would represent a minus error, and an assignment of 100 percent would be considered 
correct. 

Tables 4 and 5 give the percentage of plus and minus errors for different values of 
N and P for each of the 9 facilities in this study. In assignments by shortest time 
nearly all of the errors are plus errors, and in assignments by shortest distance most 
of the errors are minus errors except on 1 facility. Stated in another way, in assign- 
ments by shortest time there are 18 times as many plus errors as minus errors, and 
in assignments by shortest distance there are 8 times as many minus errors as plus 
errors. These facts are considered by the author to be of great statistical significance. 
At some point between the 2 extremes, where the number of plus and minus errors is 
about in balance, the total number of errors averages only about 60 percent of that 
based on assignment by shortest time and the traffic volumes assigned closely match 
the actual traffic counts. The trend in percentage of assignment errors for various 
values of N and P is shown in Figures 9 and 10. Figure 11 shows a comparison of 
assignments by shortest time and shortest distance and the optimum exponential and TD 
factors. For the latter two, the volumes assigned were each 99 percent of the actual 
number of trips. 


APPLICATION TO NETWORK ASSIGNMENT 


Most computer programs are written for assignments by shortest time path. All 
that would be necessary for these procedures to be applied would be to substitute the 
formula for exponential factor or TD factor in the program for the computation for 
travel time in determining the link spacing for traffic assignment. The program should 
be written so that the value of N or P could be easily changed between assignment runs 
to get the best assignment. 

There is in existence at the present time a program for the TD factor. It was pre- 
pared in 1970 and is now a part of the standard FHWA urban transportation program 
system battery for the IBM 360 computer. The TD factor, in principle, will be found 
in the instructions for the weight card (WEIGHT) in the program deck BUILDVN, al- 
though the TD factor as such is not mentioned. It can be developed by assigning a weight 
to time and to distance in accordance with the instructions for using the card. 

The computation of network root mean square error is now available in assignment 
programs. This measure would appear to be a logical one in determining the optimum 
value of N or P for the best assignment. After the optimum assignment by this pro- 
cedure is reached, some further improvement in the calibration is possible by an 
iterative process. However, because assigned traffic volumes will probably already 
be close to the actual, it would appear that the size of subsequent speed adjustments 
per iteration should be set somewhat lower than usual when assignments are made on 
the basis of travel time. 
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Figure 10. Percentage of zone-to-zone movements incorrectly assigned for selected values of N and P. 
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Before adopting these procedures, the planner may wish to run some test assign- 
ments using the exponential factor or TD factor. In doing so, he will probably select 
for the tests some readily available input data from a network previously assigned. 

In this event, he is cautioned to be sure that the speeds used are objectively measured 
speeds that have not been revised or adjusted prior to or as a part of the previous 
assignment runs. A number of planners realize that when assignments are by shortest 
time, the measured speeds give too high an assignment to high-speed facilities. As 

a result some of them may have either subconsciously or systematically reduced input 
speeds of the high-speed facilities relative to those of low speed. This may even have 
been done by changing the definition of the speed measurement for each type of facility. 
If such an adjustment in input has been made, the exponential factor or TD factor will 
not give satisfactory results. Some typical changes that may have been made that the 
planner should look for are 


1. Selective manual speed reductions on some high-speed facilities, usually free- 
ways; 

2. Use of peak-hour speeds on freeways and off-peak speeds on other facilities; 

3. Manual upward adjustment of arterial speeds in the CBD; and 

4. Use of average speeds on rural freeways and 85 percentile speeds on other 
rural arterials. 


The discussion given above has related strictly to assignment of trips, without re- 
gard to how the trips are distributed. The distribution could be the origin-destination 
data from the original survey, or it could be based on a gravity or other model. If 
the distribution is by a gravity model, some error may be introduced by using travel 
time asa basis. Logically, better results would be possible by using the optimum TD 
or exponential factor in the model instead of travel time. 

When the exponential factor is compared with the TD factor, it is apparent that al- 
though both give strikingly better assignments than that given by the shortest time path 
the difference between the optimum assignment by each is not great enough to be con- 
clusive. Either can be used with confidence, and perhaps after several full-scale 
assignments on complete networks a preference will gradually develop for one or the 
other. 


ACKNOWLEDGMENT 


The opinions, findings, and conclusions expressed in this paper are those of the 
author and not necessarily those of the Federal Highway Administration. Sincere 
appreciation is extended to the Research and Planning Division of the South Dakota 
Department of Highways for preparing the charts and for excellent cooperation in pro- 
viding computer services through the Computer Services Division. Also I wish to 
thank Charles W. Chappell, Jr., of the Federal Highway Administration for his un- 
usually conscientious work in writing the computer program and following through on 
the task to the final computer tabulations. Acknowledgment is also made to the Texas 
Highway Department for furnishing the Gulf Freeway data. 


REFERENCES 


1. Benesh, A. H, Traffic Assignment by the Shortest Path Method Using the TD Fac- 

tor. Traffic Quarterly, Oct. 1967, pp. 553-567. 

Traffic Assignment to the Gulf Freeway. Texas Highway Department, Dec. 1954. 

Moskowitz, K. California Method of Assigning Diverted Traffic to Proposed Free- 

ways. HRB Bull. 130, 1956, pp. 1-26. 

4. Trueblood, D. L. Effect of Travel Time and Distance on Freeway Usage. HRB 
Bull. 61, 1952, pp. 18-37. 

5. May, A. D., Jr., and Michael, H. L. Allocation of Traffic to Bypasses. HRB 
Bull. 61, 1952, pp. 38-58. 

6. Kansas Interstate 70 Usage Study. State Highway Commission of Kansas, Summary 
Rept., 1962-1965. 

7. South Dakota Interstate Highway 29 Usage Study. South Dakota Department of High- 
ways, 1970. 


ow hb 


SEGMENTED, MULTIMODAL, INTERCITY 
PASSENGER DEMAND MODEL 
John W. Billheimer, Stanford Research Institute, Menlo Park, California 


This report documents the specification, calibration, and performance of a 
segmented mathematical model developed to predict intercity travel by 
mode within and around the state of Michigan. The performance of dif- 
ferent existing demand models is studied, and a model is formulated that 
uses income data and cost, time, and frequency of modal service to predict 
the travel volumes linking a wide range of city sizes. Model parameters 
are estimated by using a constrained-search calibration technique. Model 
performance is documented, and the sensitivity of this performance to 
changes in input data and calibration parameters is discussed. 


eMATHEMATICAL models developed to predict intercity passenger demand have typi- 
cally focused on travel between densely populated urban areas. Models developed and 
calibrated in this fashion generally fare poorly in predicting travel demand between 
sparsely populated areas and between a large city and a smaller urban area. Yet, state- 
wide transportation planning entails the forecasting of traffic between cities of any size. 
This report describes the development of a segmented mathematical model designed to 
represent the demand for travel between cities of any size in Michigan. 


MODEL SPECIFICATION 


Range of Performance 


Michigan covers an area of 58,216 square miles. Its populated areas range in size 
and type from isolated rural hamlets to the heavily industrialized Detroit area. An in- 
tercity passenger demand model designed for use in the state must be capable of pre- 
dicting traffic by mode between cities of widely varying population densities separated 
by distances ranging from 50 to 600 miles. To assist in the formulation and calibration 
of a model having this capability, a set of 15 sample origin-destination pairs was 
selected to reflect the range of population-distance combinations existing in and around 
Michigan. These sample pairs are given in Table 1. The designations given in Table 1 
are as follows: 


Item Designation 

Population 

20,000 to 50,000 Small 

200,000 to 1,000,000 Medium 

2,000,000 to 5,000,000 Large 
Distance, miles 

50 to 100 Short 

150 to 250 Medium 

400 to 600 Long 
City 

Alpena ALP 

Sault Sainte Marie SSM 

Houghton HOU 

Flint FLI 

Caro CAR 
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Ttem Designation 
City 

Detroit DET 
Kalamazoo KAL 
Columbus (Ohio) COT, 
Des Moines DES 
Chicago CHI 

Philadelphia PHL 


Performance of Existing Models 


As a first step toward selecting a model for use in predicting Michigan intercity 
traffic, the 15 sample origin-destination pairs were used to test several existing uni- 
modal and multimodal demand models. A list ofthe models is given inTable 2. Details 
regarding the structure of the models may be found in the indicated references. All of 
the models represent traffic between zones as a function of trip generation character- 
istics and some measure of interzonal impedance. As such, all of the models are de- 
scendants of the gravity model. 

Table 3 gives the results of using each of the demand models to predict traffic be- 
tween the city pairs. This table also gives actual measurements of intercity automobile 
traffic as compiled by the Michigan Department of State Highways in studies conducted 
between 1964 and 1968. A comparison of predicted and observed values shows that ex- 
isting models generally cannot cope with the range of city sizes and distances to be 
found in Michigan. As might be expected, model 4, developed and calibrated specifically 
for use in the state, does the best job of reproducing actual traffic measurements, 
although it appears to overstate traffic between cities separated by short distances. 
Unfortunately, this model is limited to the prediction of automobile traffic. 

The performance of the remaining models hardly can be termed promising. Models 
1 and 2, both classic gravity models, perform poorly when uprooted from their places 
of calibration and applied to the range of city pairs existing in Michigan. Model 3, which 
woefully understates traffic, might profit slightly from a change of coefficients. Model 
5, calibrated on the large cities of the Northeast Corridor, performs poorly in estimating 
traffic between the smaller Michigan cities. 

The discrepancies between predicted and actual values given in Table 3 highlight the 
difficulty of predicting intercity passenger demand for a wide range of distances and 
city sizes. This difficulty is multiplied by the problem of designating modal preferences 
of passengers. No gravity model exists that can be pulled off the shelf and used with con- 
fidence to predict travel patterns in any arbitrary area. In this sense, the term gravity 
model, implying as it does an immutable law, is amisnomer. Isaac Newton himself 
might have had second thoughts about the validity of his gravity model had it been neces- 
sary to reformulate it for different masses and recalibrate it for different points on the 
earth's surface. 


Model Selection 


The ability to reproduce observed travel data with a reasonable degree of fidelity 
for the range of city sizes and separations encountered in Michigan was but one of the 
criteria considered in selecting an intercity demand model. In addition to this im- 
portant consideration, it was desired that the model have the following attributes: sim- 
plicity, sound theoretical structure, and ability to reflect the intermodal consequences 
of system changes, Because each of the tested models failed to reproduce modal pref- 
erences for the range of sample city sizes, the selection process centered on these 
additional attributes. Once a model having these attributes was identified, an attempt 
was made to extend its range of applicability to include the city sizes of interest in 
Michigan. 

A review of existing intercity demand models led to the selection of the basic model 
developed by McLynn (5), modified by the National Bureau of Standards (6), and sum- 
marized by the Northeast Corridor Transportation Project (7). The variables con- 


Table 1. Sample origin-destination city pairs. 


Distance 
City Size Short Medium Long 
Small to small ALP-SSM SSM-HOU 
Small to medium ALP-FLI SSM-FLI 
Small to large CAR-DET SSM-DET HOU-DET 
Medium to medium FLI-KAL FLI-COL FLI-DES 
Medium to large FLI-DET FLI-CHI FLI-PHL 
Large to large DET-CHI DET-PHL 
Table 2. Models used to test city pairs. 

Calibration Modal-Split 

Model Developer Area Capability Reference 
{ Unknown Detroit Single 1 
2 Stanford Research Institute California Multiple 2 
3 Wilbur Smith Illinois Single 3 
4 Arthur D. Little Michigan Single 4 
5 Office of High Speed Ground Northeast Multiple 5 


Transportation, U.S. De- 


Corridor 


partment of Transportation 


Table 3. Demand model predictions of daily 1-way person trips by 


automobile. 


City Actual 
Pair Model 1 Model 2 Model 3 Model 4 Model 5 Traffic 
ALP-SSM 1 2 1 22 0 19 
SSM- HOU 0 1 0 bg 0 11 
ALP-FLI a 17 2 29 1 27 
SSM-FLI 1 6 0 14 1 51 
CAR-DET 280 1,837 138 910 51 660 
SSM-DET 8 46 0 62 4 274 
HOU-DET 3 15 0 60 1 62 
FLI-KAL 56 327 31 78 29 58 
FLI-COL 37 231 1 n. a. 33 16 
FLI-DES 2 8 0 n.a. 2 3 
FLI-DET 7,021 59,378 1,302 24,859 2,877 14, 600 
FLI-CHI 262 2,032 1 127 148 77 
FLI-PHL 22 151 0 n.a. 14 5 
DET-CHI 2,635 25,732 a 597 1,391 775 
DET-PHL 279 2,461 0 n.a. 161 74 


Note: 1960 population data were used, 
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sidered by the model in determining the traffic by a mode m between origin-destination 
pair (i, j) are identified as follows: 


t, = total (i to j) travel time for the mth mode, hours; 

c, = total (i to j) out-of-pocket per capita cost, dollars; 

f, = frequency of (i to j) service, trips per day; and 

F = number of families with annual incomes exceeding $10,000 (families x 10~) in 
the SMSA or county of the origin or destination city. 


These variables can be used to define the modified demand model by the following re- 
lations: 


a@(1) a(2) 7 ae a(a) 
at; 7 [1 - exp (-Kf,)] for m # automobile 


: (1) 


ie (c,/1.7)2 for m = automobile 


(2) 


BO)(F,F,)°" we? for F,F,>G 


Dz (3) 


Ba) B(2) 


B'(0)(F,F,)” W for F,F, <G 


D, = Dw,/W (4) 


The terms w, and W may be regarded as modal conductance and total (i to j) conduc- 
tance respectively. D, and D are daily one-directional modal (i to j) demand and total 
(i to j) demand respectively (measured in persons). 

So that the model could be adapted to the wide range of city sizes of interest in 
Michigan transportation studies, the demand model was segmented as indicated in Eq. 3. 
Thus, city pairs having population products F,F, below a specified value G received a 
treatment different from that received by city pairs having larger population products. 


Data Development 


In the calibration of the demand model, numerical values were assigned to each of 
the model's parameters, and the effect of each assigned value on the model's ability to 
reproduce actual travel data was observed. The basic demand data used in this cali- 
bration process consisted of observed 1-way travel volumes by air, rail, bus, and auto- 
mobile between each of 20 origin-destination pairs for the base year 1967. The 20 
origin-destination pairs were the 15 city pairs given in Table 1 and the following 5 
additional pairs: Detroit-Cleveland (CLE), Detroit- Pittsburgh (PIT), Detroit-Milwaukee 
(MIL), Flint-Cleveland, and Flint-Milwaukee. These pairs were added to broaden the 
data base and to place additional emphasis on travel between larger cities. 

The cost, time, and frequency of common carrier service between each pair of cities 
were obtained from published schedules, and access times and costs were computed for 
each city. In the calculation of automobile costs and times, operating costs of 4 cents/ 
mile were assumed and average speeds of 60, 30, and 15 mph were associated with free- 
ways, arterials, and local streets. An average automobile occupancy of 1.7 persons/ 
vehicle was assumed. Census data from 1960 were extrapolated to 1967 in the estima- 
tion of the number of families in each origin or destination zone having a real income 
exceeding $10,000/year. 
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Calibration Technique 


Attempts to use a series of log-linear regression analyses to calibrate the model 
formulated in Eqs. 1 through 4 proved unsuccessful. Part of the explanation for this 
lack of success may be traced to the failure of the log-linear regression format to deal 
adequately with the range of city sizes under consideration. 

In lieu of regression analysis, the Michigan intercity passenger demand model was 
calibrated by means of a constrained-search technique. Through a combination of past 
experience and a knowledge of the model's structure, upper and lower bounds were set 
on acceptable values of each model parameter. A limited search was undertaken within 
those constraints for the combination of parameters that minimized a series of error 
functions describing model performance. The parameter bounds and error functions 
used in this constrained-search calibration process are described below. 

Parameter Bounds—The following logical bounds were imposed on the model parame- 
ters in advance of the calibration process: 


0 < B’(0) < B(0) 

0< p(1) < B(1) < 1.1 
0<p(2) <1 

-5 <a(j)<0;j=1, 2,4,5 
a(3) = 0.3247 

k= 0/12 

Osa, <5 


The model's consistency of behavior was ensured by imposing a positive or a nega- 
tive constraint on each parameter. In addition, the positively constrained parameters 
B’(1), B(1), and B(2) each had logical upper bounds. Experience with gravity models 
has shown that the exponent 8(1) associated with the population product rarely exceeds 
1.1. Were this exponent to be higher, population increases would have a disproportionate 
effect on predicted travel demand. Furthermore, the exponent £’(1) associated with 
small-city pairs cannot exceed the large-city exponent 8(1), This relation is indicated 
by empirical data relating intercity travel to population product for the sample city 
pairs. 

Consideration of the conductance exponent g(2) shows that the value of this exponent 
cannot exceed unity. Otherwise, a decrease in the time or cost of travel by 1 mode 
could cause corresponding increases in travel by competing modes. This can be shown 
by considering that, for small changes in time or cost, demand changes may be expressed 
as a function of the partial derivative of demand with respect to the changing variable. 

If the cost c, of travel by mode m between 2 cities were to be changed, the effect ona 
competing mode n can be represented as follows: 


AD, = (8 Dn/8ca) Ac, 


(5) 


AD, = [a(2)/cx] [8(2) - 1] D:(ws/W) Ac, 


where AD, represents a small change in demand D, for a competing mode n, and Ac, 
represents a small change in cost c,. Thus, the intermodal effects predicted by the 
proposed demand model will remain consistent only as long as A(2) = 1. 

Model consistency also demands that the modal conductance exponents a(1), a(2), 
a(4), and w(5), associated with time and cost, be negative. If these exponents are al- 
lowed to become too large, however, small changes of time or cost will have a dis- 
proportionate effect on demand. If a(2) < -5, for example, sensitivity analysis shows 
that a 10 percent decrease in the cost of mode m could cause more than a 50 percent 
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increase in the demand for that mode. Accordingly, a lower limit of -5 was placed on 
exponents a(1), (2), «(3), and «(4) to forestall such unlikely results. 

In calibrating his basic demand model, McLynn (5) empirically set K= 0.12. This 
value was used in the segmented model, as was the McLynn-calibrated value a(3) = 
0.3247. An upper limit of 5 was placed on the common carrier conductance multiplier 
a,, because it was felt that larger values of a, would create unrealistic imbalances be- 
tween common carrier traffic and automobile traffic. 

Error Functions—As the parameter values were varied within the established bounds, 
different error functions were computed and monitored to determine the overall effect 
of each parameter on the demand model's ability to reproduce observed travel data. 
These error functions reflected (a) the square root of the sum of the squares of the dif- 
ferences between calculated and observed modal travel values; (b) the sum of the ab- 
solute values of the difference between calculated and observed modal travel values; 
and (c) the number of calculated travel values that fell outside a predetermined range 
surrounding the observed value. Range settings within 10, 25, and 50 percent of the 
observed demand were monitored in the calibration process. 

Calibration Procedures—In the calibration of the segmented demand model, attention 
was first directed toward the determination of the parameters 8(0) and B(1), which were 
associated with larger-city pairs. Once these parameters were fixed, the search for 
B‘(0) and 6‘(1) was undertaken. In the case of larger-city pairs, the constrained-search 
calibration procedure followed the steps outlined below. 


Set B(0) = a, = 1; 

Select values for B(1) and B(2); 

Select values for «(1), a(2), (4), and (5); 

Compute D,, for each city pair; 

Compute error functions; 

Adjust a, to approximate modal-split proportions; 

Adjust 8(0) to minimize error functions; and 

Return to step 3 and try another combination of «(i), repeat until no further im- 
provement in the error functions appears possible for the combination of B(1) and B(2) 
selected in step 2, and then try another combination of 8(1) and A(2). 


SBABARWNrH 


In the actual calibration process, 8(1) and 8(2) were varied in increments of 0.1 until 
a combination was found that appeared to fit the observed travel data associated with 
large-city pairs. At this point, 8(0), 8(1), and 8(2) were fixed, and a search was under- 
taken for appropriate values of B’(0) and 6(1). 


Calibration Results 


The calibration process given above resulted in the identification of the following 
parameter values: 


1.5 for m= air 


r) 
tl 


0.75 for m = bus, rail 
a(i) = a(2)=-1.5 
o(3) = 0.3247; K = 0.12 
a(4) = w(5) = -1.8 
B(0) = 25,000; B’(0) = 2,500 
B(1) = 1.0; p(1)=0.1 
B(2) = 0.9 
0.075 


i?) 
" 
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Table 4 gives a comparison of the demand calculated through the use of the parame- 
ters and the observed travel between each of the 20 city pairs. Although the overall 
agreement between calculated and observed values is satisfactory, the demand model 
severely understates travel between a city pair consisting of 1 small city and 1 large 
city. The reason for this understatement is shown clearly in Figure 1, where normal- 
ized demand is plotted as a function of population product. Normalized demand is de- 
fined as follows: 


B(2) 


Desawuiiisea ae D/W 


This normalization process removes the effect of travel impedance from the demand 
term so that the resulting normalized demand should be a piece-wise log-linear func- 
tion of the trip attraction measure, the income product F,F,. Figure 1 shows that the 
normalized demand between all city pairs except Sault Sainte Marie-Detroit, Houghton- 
Detroit, Sault Sainte Marie-Flint, and Caro-Detroit clusters closely about the log-linear 
form defined by the calibration process. It would appear to be impossible to use the 
chosen model effectively to represent travel between each of these 4 city pairs without 
destroying the model's ability to reproduce the remainder of Michigan's intercity traf- 
fic. There seems to be nothing within the framework of the mathematical model to ex- 
plain, for instance, why automobile traffic between Detroit and Sault Sainte Marie should 
be nearly double the combined traffic between Detroit and the larger, closer cities of 
Pittsburgh and Milwaukee. 

In addition to highlighting data inconsistencies, Figure 1 clearly shows the need for 
segmenting the Michigan intercity demand model. The data points plotted in this figure 
make it plain that a single log-linear function cannot reflect travel demand between city 
pairs of all sizes. 


MODEL SENSITIVITY 


Effect of Variable Changes 


One test of the soundness of a demand model is its ability to behave logically in the 
face of changes in input variables. Because the Michigan intercity passenger demand 
model is a closed-form mathematical expression, its sensitivity to variable changes 
may be determined analytically. The first partial derivative of demand with respect 
to each input variable, 3D/sV, provides a measure of this sensitivity and, by inference, 
also provides a measure of the impact of each variable on intercity demand. 

The value of 3D/dV associated with each model input variable was computed and 
used to assess the effects of small (10 percent) changes in each variable on model de- 
mand and total intercity travel. Table 5 gives the results of this assessment. For 
large-city pairs, a 10 percent increase in the number of families in 1 city earning more 
than $10,000/year will increase travel demand by 10 percent across all modes. For 
small cities, an equivalent percentage increase will result in only a 1 percent increase 
in total travel. Although these differences in the modeled effect of population changes 
may be valid for extremely large cities and extremely small cities, it is illogical to 
expect such dichotomous behavior in the case of medium-sized cities. The abrupt 
transition from a 1 percent to a 10 percent increase in travel experienced when the in- 
come product F,F, exceeds G = 0.075 might be smoothed by replacing the segmented 
demand model with a continous function. 

The effects of small changes in the model input variables, time, cost, and frequency, 
vary with the importance of the individual mode in intercity travel. If mode m domi- 
nates intercity travel (i.e., if w,/W is nearly unity for mode m), the effects of modal 
changes on total intercity demand are maximized. Conversely, small changes in in- 
frequently used modes (modes for which w,/W is vanishingly small) will have slight 
effect on total intercity demand. 

Data given in Table 5 show that a 10 percent increase in the cost of travel by com- 
mon carrier between 2 cities might cause a decrease of 13.5 percent in the total travel 
demand between those cities if common carrier is the prevalent mode of intercity travel. 


Table 4. Calculated and observed values. 


City 
Pair 


ALP-SSM 
SSM -HOU 
ALP-FLI 
SSM-FLI 
CAR-DET 
SSM-DET 
HOU-DET 
FLI-KAL 
FLI-COL 
FLI-DES 
FLI-DET 
FLI-CHI 
FLI-PHL 
DET-CHI 
DET-PHL 
DET-CLE 
DET-PIT 
DET-MIL 
FLI-CLE 
FLI-MIL 


Air Rail Bus Automobile Total 
Cal. Obs. Cal. Obs. Cal. Obs. Cal. Obs. Cal. Obs. 
0 0 0 0 0 1 14 19 14 20 
0 0 0 0 0 0 3 11 3 11 
0 0 0 0 2 2 13 27 15 29 
0 0 0 0 1 5 3 51 4 56 
3 0 0 0 4 20 70 660 76 680 
1 5 0 0 1 10 3 274 4 289 
1 7 0 0 0 0 1 62 1 69 
0 0 3 3 14 25 60 58 77 86 
4 0 1 2 1 2 13 16 19 20 
1 1 0 0 0 1 0 3 2 5 
51 9 55 30 269 250 4,096 4,618 4,470 4,907 
29 31 4 5 9 20 59 77 101 133 
14 4 0 0 1 2 2 5 18 ll 
660 631 97 80 149 150 802 7175 1,708 1,636 
155 251 5 5 19 20 40 74 220 350 
332 137 41 3 94 25 650 572 = 1,117 737 
188 139 34 2 62 10 127 103 411 254 
116 134 16 2 28 10 42 41 202 187 
25 9 0 1 4 4 26 22 55 36 
3 3 1 0 2 1 4 4 10 8 


Figure 1. Normalized demand versus income product. 
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Table 5. Model sensitivity to 10 percent increase in input variables. 


Variable 
F 
t. 
Cy 


f. 


Case 


F,F,; > 0.075 

F,F; < 0.075 

m # automobile 
m = automobile 
m / automobile 
m = automobile 
m # automobile 


Change in Modal 
Demand (percent) 


Change in Total 
Demand (percent) 


Minimum Maximum 


Minimum Maximum 
10.0 10.0 10.0 10.0 
1.0 1.0 1.0 1.0 
-13.5 -15.0 0.0 -13.5 
-16.2 -18.0 0.0 -16.2 
-13.5 -15.0 0.0 -13.5 
-16,2 -18.0 0.0 -16.2 
0.0 3.9 0.0 2.6 
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If automobile is the prevalent mode of intercity travel, the effect of such a cost increase 
on total intercity travel would be negligible. The effect of the fare increase on travel 
via the affected mode would be a loss of between 13.5 and 15 percent of the mode's pre- 
increase travel volume. Similar ranges would be expected in the event of a 10 percent 
increase in travel time. These ranges are given in Table 5 along with the correspond- 
ing ranges for changes in the time and cost of automobile travel. The magnitude of 
these small changes does not appear to be unreasonable, and, thanks to the constraints 
imposed in the calibration procedure, the direction of change is proper. 

Service frequency is the least effective of the input variables in terms of its ability 
to influence sizable demand changes. A 10 percent increase in common carrier service 
frequency can effect no more than a 3.9 percent in modal patronage and no more than a 
2.6 percent increase in total intercity travel. 


Effect of Parameter Changes 


Just as the effect of variable changes on predicted demand gives a measure of model 
reasonability, so the effect of parameter variations on demand gives a measure of model 
stability. If a small parameter change can drastically alter model output, the calibra- 
tion procedure is complicated, and model validity may be suspect. 

The effect of small parameter changes on total demand is quite complex and may de- 
pend on the relative impact of a mode on intercity travel; on the existing population 
product; on current levels of time, cost, and frequency; or on all of these factors. An 
evaluation of 3D/aP, the first derivative of demand with respect to each model parame- 
ter, shows that the parameters whose changes have the greatest potential impact on de- 
mand are the time and cost components a(1), ~(2), a(4), and w(5) and the conductance 
exponent £(2). 

The segmenting of the demand model buffers the effect of changes in the income ex- 
ponent B(1). Were it not for this segmentation, a 10 percent change in the parameter 
B(1) could effect a 90 percent change in the demand calculated between a small-city 
pair. This buffering effect suggests that model stability and performance might be 
improved by similarly segmenting the model with respect to the conductance exponent 
B(2). Such a segmentation would buffer the potentially pronounced effect of changes in 
the modal time and cost exponents. 


INDUCED AND DIVERTED DEMAND 


When improvements in a single mode cause an incremental increase in the number 
of travelers using that mode, these travelers can be assumed to come from 1 of 2 
sources: (a) other modes (diverted demand) or (b) the pool of potential travelers who 
currently are not included in the total intercity demand (induced demand). Thus, total 
modal increases are made up of travelers diverted from other modes and travelers in- 
duced to make the intercity journey for the first time (or more often). Although the 
calibrated demand model behaves logically in reproducing the overall impact of variable 
changes, numerical results of a number of model runs revealed that the model clearly 
overstates induced demand at the expense of diverted demand. 

The reason for this overstatement becomes clear if the sources of incremental de- 
mand increases are investigated. Equation 5, repeated here for the sake of convenience, 
expresses the effect of an incremental cost change in mode m on a competing mode n. 


AD, = [a(2)/c,][8(2) - 1] (w,/W) D,ac, forn/m 


The effect of the cost change on the demand for service via mode m is as follows: 


AD, = [a(2)/e,] D, [1+ (w,/W) [B(2)- 1]} ac, (6) 
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Summing the expression given above across all modes gives the total intercity demand 
increment. 


AD = AD,+ 4B AD, (7) 
n#m 
= [e(2)/c,] B(2)D,ac, (8) 


In the case of a cost decrease, the constrained calibration procedure forces aD, to be 
positive and AD, to be negative. Hence, the total demand increment AD will represent 
the total induced demand. The ratio of induced demand to the incremental demand in- 
crease via mode m may be found as follows: 


AD/AD, = [8(2)]/{1+ (w,/W) [A(2)- 1)) (9) 


For the calibrated value of 8(2) = 0.9, this ratio will vary from 0.9 to 1.0 as the ratio 
w,/W varies from 0 to 1. Thus, the induced demand component of traffic increases 
predicted by the intercity demand model will range between 90 and 100 percent. This 
is not a realistic state of affairs. The model's realism may be improved, however, by 
defining arbitrarily a more reasonable limit on induced demand and redistributing de- 
mand forecasts in accordance with this limit. A simple means of accomplishing this 
redistribution is to let 


= (Do + yAD) (w,/W) (10) 


where Do represents original intercity demand and y represents an arbitrary scaling 
factor (0< y< 1). 


FUTURE WORK 


Model Improvements 


Future work to improve the accuracy and plausibility of the Michigan intercity de- 
mand model might profitably explore the following subjects: segmentation over dis- 
tance; formulation of a continuous model; development of an induced demand correction 
factor; and investigation of the variation of parameter values over time. 

Distance Segmentation—The possibility of segmenting the demand model as a function 
of distance by associating different values of 8(2) with different conductances has been 
noted already. The intercity highway traffic model designed for Michigan (4) was seg- 
mented in this fashion with good results. Such a segmentation would correct for the 
tendency of the current model to understate long-distance trips (more than 600 miles). 

Continuous Model—Certain inconsistencies in model performance might be overcome 
by developing a continous demand model having the features of the segmented model. 

A continuous model having these features is shown below. 


D = B(0)°-" re a ca aol B’ (0) wS (11) 


where S = exp [-y(F, x, )+r7]. The variables p and r are calibration constants, and the 
remaining model variables have the definitions stated in Eqs. 1, 2, and 4. SRI has 
achieved some success in calibrating the model of Eq. 11, but more experimentation 
is necessary before this model can replace the current segmented formulation. A 
similar continuous formulation could be employed to vary the parameter A(2) for dif- 
ferent intercity distances. 

Induced Demand Correction Factor—Historical data regarding induced demand should 
be gathered in an effort to estimate the value of the parameter y used in Eq. 10 to cor- 
rect for the model's tendency to overstate induced demand. 
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Time-Varying Parameters—If the functional form of the demand model is correct, 
it seems likely that parameter values will change with time. This supposition should 
be checked by calibrating the model at different points in time and attempting to ex- 
plain and quantify any differences in the calibration parameters. 


Model Application 


The true utility of the developed demand model is best tested by applying the model 
in the investigation of intercity transportation problems. In the course of SRI's 
Michigan studies, the model has been applied to the task of predicting potential air 
traffic from a proposed regional airport (8) and evaluating alternative high-speed rail 
routes between Detroit and Chicago (9). The model performed creditably in these tests. 
More such tests are needed to substantiate the model's current capability and to point 
the way toward future improvements. 
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EFFECT OF ZONE SIZE ON TRAFFIC ASSIGNMENT 
AND TRIP DISTRIBUTION 


Bruno R. Wildermuth, Wilbur Smith and Associates; 

D. J. Delaney, Transport Planning Division, Commonwealth Bureau of Roads, 
Melbourne, Australia; and 

K. E. Thompson, Bureau of Transport Economics, Canberra, Australia 


The analysis in this paper of the effect of zone size on traffic assignment 
and trip distribution was based on the results of a research study initiated 
by the Australian Commonwealth Bureau of Roads. This investigation was 
concerned with identifying criteria for zone size selection, based on the 
results of applying standard trip distribution and traffic assignment proce- 
dures to an actualroad network and vehicle trip matrix. Vehicle trips were 
ageregated to Stest systems, ranging from 40 to 263 zones, andassignments 
were made to the basic 1964 road network. The effect of different zone 
systems on traffic assignment was evaluated statistically by comparing 
assignments with those for the study area's 607-zone trip matrix and with 
1964 traffic counts at 16 check lines located throughout the studyarea. The 
effect of zonal aggregation on the simulation of trip distributions was also 
tested, both for a 1-purpose and a 4-purpose distribution. Distribution 
errors attributable to the aggregation of zones and to the grouping of trip 
purposes were evaluated for 3 zone plans by comparing assignments of the 
resulting trip matrices with the 607-zone system assignment and with the 
1964 check-line volumes. It is concluded that for many aspects of transport 
planning, zone plans with an average of 30,000 trip ends per zone will yield 
traffic assignments sufficiently accurate for predictions of traffic growth 
within transportation corridors. 


e TRAFFIC assignments for many aspects of transportation planning are not required 
to be so accurate as those required for design purposes. Rather, emphasis may be 
placed on the reliable prediction of traffic growth for travel corridors or for subareas 
of metropolitan areas. Such predictions may be desirable for evaluating a number of 
alternative urban development schemes for a range of time periods. Substantial savings 
in efforts and computer costs could be expected if estimates could be made with fewer 
analysis zones. The Commonwealth Bureau of Roads, therefore, initiated a research 
study to identify and measure the likely traffic assignment errors that would result from 
the use of fewer zones than normally used in transportation studies. 

The basic approach in the investigation was to devise a number of alternative zone 
plans and to assign the resulting rearranged trip matrix to a common road network. 
The original origin-destination survey trip matrix for all vehicle trips was used for the 
purpose. Traffic assignments were performed by utilizing 2 noniterative techniques, 
the all-or-nothing method and a multiple-route method. (The multiple-route method is 
based on the assumption that the user of a network does not know the actual link times 
he is going to encounter but that he associates with each link in the network a probable 
link time based on past travel experience. In assignments, the probable link time is 
drawn at random from a normal distribution of times for that link every time it is con- 
sidered in path building. The mean of the normal distribution is equal to the initial es- 
timate of link time, and the standard deviation of the distribution is assumed to be 18 
percent of the value of the mean.) 

An analysis was made of those trips that became intrazonal trips under different zone 
plans. Trip matrices containing only those trips were prepared and assigned to the 
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basic network to determine average vehicle-miles and vehicle-hours of travel for in- 
trazonal trips. 

Standard gravity or interactance trip distribution models were calibrated for 3 al- 
ternative zone plans. Both a 4-purpose model and a 1-purpose model were developed, 
and the resulting trip matrices were assigned to the basic road network. A unique fea- 
ture of the trip distribution model was the use of separate distribution functions for cen- 
tral business district trips, external trips, and the remaining internal trips. 

Statistical tests were made to compare assigned link volumes with traffic counts at 
16 check lines located throughout the study area. In addition, vehicle-miles and vehicle- 
hours of travel were summarized for 56 districts, identical to one of the test zone sys- 
tems. Comparisons were made with the assignment that had the smallest differences 
from the traffic counts on all links crossing the 16 check lines. Finally, the frequency 
distribution of the assigned volumes on the links of each zone system were determined. 
Particular attention was given to the links that were not assigned any trips under alter- 
native zone plans. 


STUDY DATA 


The source of data used for the research was the 1964 Melbourne Metropolitan Trans- 
portation Study, which used 607 internal traffic zones and 32 external cordon stations. 
A home interview survey of about 30,000 households (5 percent sample) provided most 
of the origin-destination data. Also surveyed were drivers of 9,000 trucks (10 percent 
sample) and nearly 600 taxis (25 percent sample). Roadside interviews at 32 external 
cordon stations provided travel data on approximately 90,000 vehicle trips. Some trips 
had both ends inside. Table 1 gives the internal and external trips by 4 trip-purpose 
categories. 

The 1964 road network coded for study purposes comprised 4,936 one-way links rep- 
resenting 1,085 miles of arterial and collector roads. The network contained 2,440 
centroid connector links and 2,519 nodes, including the 639 centroids. 

Travel time studies made in 1964 provided estimates of average daily running speeds 
for the network, Subsequently, some of the link speeds were modified to achieve better 
correspondence between actual volumes and volumes obtained from the all-or-nothing 
assignment. 


ALTERNATIVE ZONE SYSTEMS 


Various criteria, sometimes conflicting, were used in the selection of the 607 zones 
(hereafter referred to as system 607). These criteria included uniformity of land use, 
presence of physical barriers to travel, existing area subdivisions, and a requirement 
that the diameter of the zones should generally have a travel time of 3 to 6 min. For 
the purpose of statistical comparison, zone systems were defined as combinations of 
the original 607 zones. All of these criteria were observed, to the extent practicable, 
in defining the various test zone plans. 

The following ranges of internal zones were initially defined for investigation: 30 
to 50, 100 to 150, and 250 to 300 zones. The zones would have average areas of 12 to 
20, 4 to 6, and 2 to 2.5 square miles and average widths of 5, 2.5, and 1.5 miles re- 
spectively. These average values were used only as a general guide in defining zones. 
Each of the zonal systems had smaller zones near the center and larger zones in the 
outlying parts of the study area. 

The central business district and several suburban centers have trip-end densities 
far higher than those found elsewhere in the study area. These centers were retained 
as Separate zones in systems 136, 144, and 263 but necessarily were combined with 
immediately adjacent areas in the test plans with the largest zones (systems 040 and 
056). 

In addition to the reduction in the number of internal zones, external stations were 
also combined, where possible, on the basis of physical proximity. Traffic entering 
from different external stations had to use the same general travel corridors to justify 
the use of a combined external centroid. 
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The criteria selected were consistent with those generally adoptedin transport plan- 
ning; therefore, the conclusions of this study would be applicable to other cities. In 
accordance with these criteria, test systems 040, 136, and 263 were initially established. 

After the analysis of assignment results from the initial test systems, itwas concluded 
that a limit for the maximum number of trip ends per zone would substantially improve 
assignment accuracy without materially increasing the number of zones. Consequently, 
test systems 056 and 144 were established as modifications of the initial systems 040 
and 136. Accordingly, assignment errors were investigated for the range of test plans 
given in Table 2. 


ANALYSIS OF INTRAZONAL TRIPS 


The amount and percentage of intrazonal trips depend primarily on the size and type 
of land use within zones. Because intrazonal trips were not assigned to the network by 
the normal traffic assignment process, the magnitude of travel caused by these trips 
must be considered in connection with changing zone sizes. Figure 1 shows the relation 
between the percentage of intrazonal trips and the average number of trip ends per zone 
for the zone plans tested. 

To determine the magnitude ofintrazonal travelin terms of vehicle-miles and vehicle- 
hours, the intrazonal trips associated with each test system were assigned to the sys- 
tem 607 network. In this way, it was possible to isolate the link loadings resulting from 
intrazonal trips in addition to the 220,124 trips that were intrazonal for system 607. 
The all-or-nothing method was used for this purpose. (This method was selected to 
eliminate effects of the stochastic process associated with the multiple-route method. 
However, because such trips would generally travel relatively short distances, few al- 
ternative paths would be identified in a multiple-route assignment.) Table 3 gives the 
resulting vehicle travel from these assignments for the major arterial and collector 
road links (i.e., all links except centroid connectors). Table 4 gives the average and 
incremental vehicle-miles and vehicle-hours for intrazonal trips. Incremental values 
are computed from travel generated by intrazonal trips that were not intrazonal in the 
system with which the comparison is drawn. 

An investigation was also made of the concentrations of intrazonal trips. Table 5 
gives frequency distributions of 1-way links by volume prepared from the assignments 
of the intrazonal trips to the system 607 network. Depending on the number of zones, 
between 20 and 70 percent of all links will not carry any intrazonal trips. Only for the 
very coarse zone plans of systems 040 and 056 are more than 3,000 intrazonal trips 
concentrated on some links. 

It was concluded that the volumes missed in the assignment process, because of 
intrazonal trips of systems 263, 144, and 136, were not significant enough to affect 
capacity evaluations. The higher volumes of intrazonal trips assigned to some links 
in systems 056 and 040 could affect design, but even so the effects throughout a system 
planned for capacity continuity would not be great. 


COMPARISON OF ASSIGNMENTS 


The analysis and comparison of the traffic assignments resulting from alternative 
zone plans covered 2 primary areas of investigation. The first test evaluated the as- 
signments with respect to actual traffic counts. Two-way link volumes for 16 check 
lines were available for this purpose. Three of these 16 ckeck lines represented the 
major screen lines used for the verification of the original origin-destination survey 
data, and the others were located throughout the study area. The second test involved 
the comparison of vehicle-miles and vehicle-hours of travel on a district basis. The 
results from the system 607 multiple-route assignment were used as a basis for these 
comparisons. The system 607 multiple-route assignmentwas selected because it showed 
the lowest root mean square (rms) error and chi-square error for the ckeck-line com- 
parisons. 

In addition to these major tests, comparisons were made of the frequency of links 
by volume range. Some line-printer plots were also produced for visual comparison 
of assignments from the different zone systems on a link-by-link basis. 


Table 1. 1964 internal and external vehicle trips. 


Table 2. Zonal characteristics of alternative test 


systems. 
Category Internal External Total 
Automobile drivers Internal External Total 
Non-home-based 341, 693 8,824 350,517 System Zones Stations Centroids 
Home-based, work 518,578 19,443 538,021 
Home-based, other 539,579 33,922 573,501 607 607 32 639 
Commercial vehicles 631, 666 25,407 657,073 263 263 23 286 
144 144 17 161 
Total 2,031,516 87,596 2,119,112 136 436 ay ae 
056 56 17 73 
040 40 23 63 
Figure 1. Intrazonal trips in relation to average number of trip ends per zone. 
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Table 3. Vehicle-miles and vehicle-hours of intrazonal trips for 
alternative systems. 


Vehicle-Miles 


Vehicle-Hours 


Additional 

Intrazonal 
System Trips* Number 
263 139,373 104,584 
144 254,742 234,929 
136 297,504 277,471 
056 454,840 540,992 
040 594,044 793,704 


Percent of 
System 607° 


1.17 
2.63 
3.11 
6.05 
8.88 


“Intrazonal trips in addition to those already existing for systern 607. 
System 607 vehicle-miles and vehicle-hours resulting from the minimum time path all-or-nothing as- 
signment method. 


Number 


4,194 
9,009 
11,479 
20,242 
31,120 


Table 4. Average and incremental vehicle-miles and 
vehicle-hours per intrazonal trip for alternative systems. 


Incremental’ 
Average” 
Com - 
Vehicle- Vehicle- parison Vehicle- Vehicle- 
System Miles Hours System Miles Hours 
263 0.75 0.030 607 0.75 0.030 
144 0.92 0.035 263 1.13 0.042 
136 0.93 0.039 263 1,09 0.046 
056 1.19 0.045 144 1.53 0.056 
040 1.34 0.052 136 1.74 0.066 
056 1.82 0.078 


*Computed from Table 3, excluding intrazonal trips from system 607. 


® Computed for intrazonal trips that were interzonal in the comparison system cited. 


Percent of 
System 607° 


1.19 
2.56 
3.27 
5.76 
8.85 


Maximum 
Number of 
Trip Ends 
per Zone 


35, 894 
124,422 
79,176 
217,355 
124, 422 
342, 813 
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Network Preparation 

The original 1964 road network of the Melbourne Metropolitan Transportation Study 
was used for each assignment. This was accomplished by building a link data card 
image tape on which centroid connectors only were changed, or eliminated, asnecessary. 

For each zone system, one of the original system 607 zone centroids was designated 
to represent each new group of zones. All other centroids within the group were elim- 
inated. The analysis of the initial assignments for systems 263, 136, and 040 indicated, 
however, that this method was not completely satisfactory because many of the original 
zones were connected with only 1 or 2 centroid connectors. Although this method is 
quite satisfactory for a large number of small zones, it proved to be a disadvantage for 
the zone plans with fewer large zones. The lack of an adequate number of alternative 
centroid connectors caused very large volumes of trips to be assigned to links in the 
immediate vicinity of the connectors and thereby caused gross overloadings on certain 
links. Increasing the number of connectors distributes interzonal trips more evenly 
onto the network, a detail that assumes increasing importance with decreasing numbers 
of zones. Consequently, additional centroid connecting links were coded for access to 
each zone in the preparation of networks for systems 144 and 056. 


Analysis of Check-Line Crossings 


Sixteen check lines ranging in volume from 21,500 to more than 500,000 vehicles /day 
were used in this analysis. The number of 2-way network links involved ranged from 
3 for check lines 8 and 13 to 24 for check line 1, the major north-south screen line. 
The total for all check lines was 194 links. 

The rms error, a measure of the average error on individual links, and the chi- 
square error, a cumulative measure of the error terms with emphasis on the large 
errors, were selected for the analysis. 


/ n 
rms error = /(1/n) © (G - A)’ 
i= 


n 
chi-square error = )> [(G, - A,)"1/G, 


i=l 


where 


G, = ground count volume for link i, 
A, = assigned volume for link i, and 
n = number of links. 


Table 6 gives the assignment errors, measured on links crossing the 16 check lines, 
for each alternative zone system and for the 2 assignment methods. These comparisons 
show somewhat better results for the multiple-route assignment method than for the all- 
or-nothing method or the networks that were coded to allow an adequate range of alter- 
native paths to be selected. The results illustrate that overall assignment errors at 
check lines increased relatively slowly as the number of zones was decreased. Systems 
263 and 144, in which the original number of zones was reduced to less than one-half 
and one-fourth respectively, produced highly acceptable assignment results at all check 
lines. Chi-square errors for these 2 systems were 7 and 9 percent greater than for the 
lowest errors measured, which were for the multiple-route assignment to system 607. 
The error for system 144 was 6 percent greater than the lowest error, and the increase 
in error for system 263 was negligible. The rms errors measured for all 16 check-line 
crossings were nearly identical for systems 607 and 263, and the corresponding chi- 
square errors increased between 7 and 10 percent. This indicates that errors for some 
individual links were increased by the coarser zoning but that the average error re- 
mained unchanged. 
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Table 5. Frequency of 1-way links by volume for 
intrazonal trips assigned to system 607 network. 


Link-Volume System System System System System 


Range 263 144 136 056 040 
0 3,514 2,371 2,242 1,379 1,038 
1-1,000 1,408 2,529 2,546 3,207 3,059 
1,001-2,000 14 36 142 310 683 
2,001-3,000 0 0 6 32 134 
3,001-4,000 0 0 0 6 20 
4,001-5,000 0 0 0 2 2 
Total 4,936 4936 4936 4,936 4,936 


Table 6. rms and chi-square assignment errors for links 
crossing 16 check lines. 


Multiple-Route All-or-Nothing 
Assignment Assignment 
Chi-Square Chi-Square 

System rms Error Error rms Error Error 
607 7,022 591,928 7,596 648,321 
263 7,047 635,564 7,494 714,428 
144 7,454 647,452 8,940 850,789 
136 8,722 800,039 8,635 723,685 
056 9,618 1,189,126 10,553 1,327, 603 
040 12,510 2,105,617 12,255 1,895,987 
Table 7. Vehicle-miles and vehicle-hours measured Table 8. rms errors for district comparisons by 
by 2 assignment methods. multiple-route assignment. 

Multiple-Route All-or-Nothing Coefficient of 

Assignment Assignment rms Error Variation"* 

Vehicle- Vehicle- Vehicle- Vehicle- Vehicle- Vehicle- Vehicle- Vehicle- 
System Miles Hours Miles Hours System Miles Hours Miles Hours 
607 9,145,191 360,992 8,934,996 351,487 263 9,826 411 6.0 6.4 
263 9,127,628 361,396 8,912,493 351,564 144 16,244 637 10.0 9.9 
144 8,710,338 343,250 8,460,635 330,922 136 15,818 778 9.7 12.1 
136 8,965,009 357,042 8,765,627 348,187 056 22,160 879 13.6 13.6 
056 8,532,841 329,620 8,313,325 327,089 040 38,307 1,594 23.5 24.6 
040 8,895,391 352,889 8,701,668 342,589 ee 

*rms error expressed as the percentage of the mean vehicle-miles 

Note: Travel on major arterial and collector roads only, i.e., excluding (163,307) and the mean vehicle-hours (6,446) for the system 607 
centroid connectors. multiple-route assignment. 


Table 9. Cumulative difference by district of vehicle-miles 
by multiple-route assignment for system 607, 


Percentage 

Difference System System System System System 
Range 263 144 136 056 040 
= 0.00- 2.49 19 11 14 12 7 
+ 2.50- 4.99 37 24 23 21 12 
+ §.00- 7.49 44 31 28 26 20 
+ 7,50- 9.99 51 40 37 35 22 
+10.00-12.49 54 43 43 38 25 
#12.50-14.99 54 48 a7 41 32 
+15.00-19.99 55 50 50 48 35 
+20.00-24.99 56 54 53 48 41 
+25,00-29.99 56 55 53 §2 44 
+30.00-34.99 56 55 56 54 49 
+35,00-39,.99 56 55 56 54 54 
+40.00 and 


more 56 56 56 56 56 
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Analysis of Total Vehicle Travel 


From the analysis of intrazonal trips, it would be expected that reductions in the 
number of zones would be accompanied by a reduction in the assigned vehicle-miles and 
vehicle-hours of travel. Table 7 gives these travel measures for each alternative zone 
system for each of the assignment methods. 

The reduction in vehicle-miles for system 263 from those for system 607 was less 
than 0.2 percent; the vehicle-hours showed a small increase. The largest decrease in 
vehicle-miles, registered for system 056, was almost 7 percent of the total travel mile- 
age for system 607. The consistently higher amounts of travel for the multiple-route 
assignments would be expected because of the selection of alternative paths, which would 
be longer than the minimum paths. 

The loss of vehicle travel on the network generally was less than the travel lost in 
intrazonal trips for the networks that did not have sufficient centroid connectors. Sys- 
tems 056 and 144, for which additional connectors were coded, had travel losses only 
slightly larger than losses due to intrazonal trips. However, even for system 056, 
which had less than one-tenth original number of zones, the loss in vehicle-miles and 
vehicle-hours of assigned traffic was only 7 percent of the system 607 travel. For 
many aspects of transportation planning, errors of such small magnitude would be accept- 
able. 

Further analysis of the assignments was concerned primarily with identifying the 
range of assignment errors in sections of the metropolitan area. System 056 was used 
as a basis for this analysis, and each link on the arterial and collector road system was 
identified by the district in which it was located. Vehicle-miles and vehicle-hours of 
travel from the multiple-route assignments for each test system were summarized, by 
each of the 56 districts, and compared with the travel summaries from the multiple- 
route assignment for system 607. The resulting rms errors are given in Table 8. 

Table 9 gives the cumulative frequency of districts by percentage difference of 
vehicle-miles for the alternative zoning system. Vehicle-miles for system 263, for 
instance, differed from those of system 607 by 10 percent or more in only 5 districts. 
In only 2 of these 5 districts was the difference more than 12 percent. Thus, in 95 per- 
cent of the districts, assigned travel for system 263 would be in error by no more than 
12 percent. For systems 144 and 136, however, an error larger than 12 percentoccurs 
in 13 districts, or 23 percent of all districts. 

A typical spatial distribution of percentage differences in vehicle-miles of travel by 
district is shown in Figure 2. Visual inspections of such distributions for all systems 
indicated that the spatial pattern of errors did not seem to be systematic from one test 
system to the next. Only 6 districts (2, 8, 9, 30, 41, and 46) were found to have an 
error greater than 5 pecent in all test systems. District 33 was the only district that 
had less than 5 percent error in each test plan. 


Comparisons of Assignments by Link Volumes 


Analyses were also made of frequency distribution of link volumes. Of particular 
interest was the number of links that were assigned either no volumes or very low vol- 
umes. The number of such links would be expected to increase as the number of zones 
decreased. Analysis of the frequency tabulations indicated that with the multiple-path 
assignment fewer links were unused. In fact, the number of links with 0 volumes was 
generally half that for a minimum time path assignment. Because only the major arte- 
rial and collector roads were represented by the network, the multiple-path assignment 
was more realistic in this aspect. 

Figure 3 shows that a consistent relation existed among the number of low-volume 
links (0 to 2,000) the number of zones used, and the assignment technique applied. 

Links with excessively high volumes could also constitute a problem in zone aggre- 
gation. However, most of the extremely high-volume links were found to be located 
immediately in the vicinity of centroid connections. With the limitation of the maximum 
number of trip ends and the provision of adequate alternative connections to centroids, 
as used for systems 056 and 144, these extremes did not occur. 


Figure 2. Differences in vehicle-miles by multiple-route 
assignment for systems 263 and 607. 
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Figure 3. Low-volume lines and 0-volume lines for 2 assignment methods related to average 
number of trip ends per zone. 
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ASSIGNMENT TEST FINDINGS 


The investigation established measures of traffic assignment errors resulting from 
tests of widely varying zone plans. Also analyzed were the percentage of intrazonal 
trips and the corresponding magnitude of travel lost in zone aggregation. 

The amount of travel lost because of intrazonal trips was relatively small, even 
though these trips may constitute a sizable percentage of total trips. Intrazonal trips 
lost ranged from 17.0 percent for system 263 to 38.6 percent for system 040. However, 
these trips accounted for only 1.2 and 8.9 percent respectively of assigned vehicle-miles 
of travel. Such losses could easily be tolerated for many planning purposes, especially 
because the study found that these trips are not concentrated to any significant degree. 

Comparisons of assigned volumes with traffic counts at 16 check lines indicated that 
the number of zones could be reduced to less than half of the original number before 
increases in rms errors occurred. The multiple-route assignment produced lower er- 
rors than the all-or-nothing assignment. 

Within sections of metropolitan Melbourne, the comparison of alternative assignments 
could be made only against the system 607 multiple-route assignment that, of course, 
was affected by some errors of its own. rms errors for 56 districts for multiple-route 
assignment to systems 263, 144, and 136 were within 10 percent of the average vehicle- 
miles and 12 percent of the average vehicle-hours for system 607. All-or-nothing as- 
signment increased these errors by between 1 and 2 percent for vehicle-miles and up to 
3 percent for vehicle-hours. 

The analyses undertaken in this study indicated that the number of zones commonly 
selected for metropolitan transportation studies may be reduced to a third or even a 
fourth and still produce traffic assignments with adequate reliability for many planning 
purposes. It was further shown that additional traffic assignment errors can be mini- 
mized by the use of multiple-route assignment, the provision of adequate centroid con- 
nections, and reasonable limitations on the maximum number of trip ends concentrated 
within any one analysis zone. 


ANALYSIS OF TRIPS BY PURPOSE 


In the transportation planning process, trips are generally categorized by a number 
of different purposes, and trips made by different vehicles, such as passenger cars, 
trucks, and taxis are usually treated separately. There are valid reasons for this sep- 
aration, including differences in trip length, peak-hour concentration, and spatial dis- 
tribution. However, each additional category increases the cost of data preparation and 
computer processing. Thus, substantial economies would result if categories could be 
combined for the purpose of trip distribution. 

The study investigated the effect of reducing the number of purposes normally used 
in a transportation study and the effect of zone aggregation on trip distributions. A 4- 
purpose model and a 1-purpose trip distribution model were considered, each being cal- 
ibrated for 3 different zone plans: systems 263, 144, and 056. 

The 1-purpose model was calibrated against the all-vehicle trip matrix used for the 
assignment test phase of the study. The 4-purpose model consisted of 3 categories of 
automobile driver trips (non-home-based, home-based work, and home-based other) 
and 1 category of commercial vehicle trips. Home-based trips were arranged so that 
trip production was always from the same zone as the home of the driver. Origins and 
destinations of non-home-based and commercial vehicle trips were assumed to corres- 
pond to production and attraction zones respectively. 


Trip Length Distributions 

Trips of different purposes and vehicle types are generally found to have significant 
variations in trip length characteristics. Average trip lengths for the categories of 
trips used in the study are given in Table 10. Average trip lengths were affected to 
some degree by reductions in the number of zones because of different locations of zone 
centroids and changes in the proportion of intrazonal trips. 
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Stratification of Trips by Area 


Early in the calibration process it was found that, despite excellent overall corre- 
spondence between observed and estimated trip-length frequency distributions, large 
differences in trip-length characteristics existed for trips associated with the CBD and 
with external stations. Accordingly, the trip matrices were partitioned for separate 
analyses of each of these 2 classes of trips and for all other trips. 

It was found that the percentage of CBD trips was highest (15.4 percent) for non-home- 
based trips and lowest for home-based other trips (4.9 percent). For external trips, 
the order was reversed with 5.6 percent for home-based other trips and 2.3 percent for 
non-home-based trips. For all trips, the distribution included 9.1 percent CBD trips 
and 4.0 percent external trips. Of all CBD trips, 37 percent were made by commercial 
vehicles. 

Average trip lengths for trips of the selected spatial classes are given in Table 11. 
For all categories of trips, CBD and external trips were substantially longer than in- 
ternal non-CBD trips. Thus, although these 2 trip classes constituted only a small per- 
centage of all trips, they accounted for a substantially larger portion of the total vehicle- 
hours of travel, ranging from 17 percent for home-based work trips to 25 percent for 
non-home-based and commercial vehicle trips. If all trips are processed in 1 group, 
the trip distribution does not adequately reflect the length and special orientation of 
these trips. However, with the calibration of separate distribution functions for each 
of the 3 spatial classes, as was done in the study, a satisfactory trip distribution was 
achieved. 


Intrazonal Trips 


The number of intrazonal trips increased substantially as the number of zones was 
reduced. Figure 4 shows the variation in the percentage of intrazonal trips of different 
trip categories plotted against average trip densities as obtained from the analyses of 
systems 263, 144, and 056. The proportion of intrazonal trips is best for home-based 
work trips, ranging from 3.3 percent for system 607 to 10.3 percent for system 056. 
Home-based other trips and commercial vehicle trips have the largest proportion of 
intrazonal trips, ranging from 13.3 and 14.0 percent to 30.5 and 26.1 percent respec- 
tively. 


TRIP DISTRIBUTION MODEL CALIBRATIONS 


Trip distribution models, using the 3 area stratifications, were calibrated for sys- 
tems 263, 144, and 056 for 5 categories of trip. Four calibration cycles were applied 
in each case to determine the proper impedance function. No particular difficulties 
were experienced with the calibration of any of the distribution models. The average 
estimated trip length for each completed calibration differed from the observed trip 
length by no more than 0.1 min, the largest discrepancy between values of the cumu- 
lative trip-length frequencies being approximately 1 percent. 

The differences between the impedance functions for trips associated with the spatial 
classes are shown in Figure 5 for the all-vehicle category and system 144. For long 
trips, impedance to travel tends to be greater for internal trips than for CBD or exter- 
nal trips. This same relation was observed for each of the categories considered in 
the study. 

Variations among impedance functions for each of the 3 test systems are shown in 
Figure 6 for the all-vehicle category of trips. The curves were markedly similar, es- 
pecially for the middle range of travel times into which most of the trips fall: Fewer 
than 2 percent of the trips were longer than 50 min, the value beyond which the curves 
diverge. At the other extreme, the variation among impedance values for the very short 
trips would be expected because of the differing numbers of intrazonal trips. However, 
for most of the range, the curves followed one another closely and demonstrated that 
the impedance function described trip-maker characteristics that, of course, were in- 
dependent, for this range, of the number of zones into which the study area was divided. 
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Table 10. Average trip length in min by category. 


Category 


Automobile drivers 
Non-home~-based 
Home-based, work 
Home-based, other 


Commercial vehicles 


Total 


System 
263 


15.4 
19.6 
13.5 
15.0 


15.8 


System System 


144 


14.9 
19.0 
13.0 
14.7 


15.4 


056 


14.6 
19.0 
13.4 
14.4 


15.4 


Table 11. Average trip length in min by category and 
spatial class for system 263. 


Category 


Automobile drivers 
Non-home-based 
Home-based, work 
Home-based, other 


Commerical vehicles 


Total 


Figure 4. Intrazonal trips by category related to average number of trip ends per zone. 


CBD 


21.2 
29.3 
29.5 
18.7 


23.4 


Internal External 


14.0 
18.5 
11.8 
14.0 


14.5 


Total 
24.9 14.3 
23.9 19.6 
27.1 13,5 
26.8 15.0 
26.1 15.8 
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Figure 5. Impedance functions by spatial class for all vehicles and system 144. 
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Figure 6. Impedance functions by system for all vehicles and internal trips. 
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Of the 3 impedance functions shown in Figure 6, the curves for systems 144 and 263 
showed the most striking resemblance. Although the system 056 curve is similar, it 
seems that the drastic reduction in number of zones caused some loss of sensitivity. 
This tends to suggest that system 056 partitioned the study area into too few zones. 


ANALYSIS OF TRIP DISTRIBUTION ASSIGNMENTS 


The statistical evaluation and comparison of the 1-purpose and 4-purpose distribu- 
tion models for the 3 test systems were carried out in the same way as the investiga- 
tion of the effects of zone aggregation on assignment. Two assignments were prepared 
for each of the 3 zone systems: one with the trip matrix from the 1-purpose modeland 
one with the all-vehicle trip matrix resulting from the sum of the 4-purpose trip tables. 
All assignments were made by using the multiple-route technique. The following anal- 
yses were made: 


1. Two-way link volumes were compared with traffic counts for the 16 check lines; 

2. Vehicle-miles and vehicle-hours of travel were accumulated for 56 districts and 
compared with the basic results from the system 607 multiple-route assignment of the 
O-D survey trip matrix; and 

3. The frequencies of links by volume ranges were compared. 


The results obtained for both the 1-purpose and the 4-purpose trip distributions were 
compared with the corresponding zone plan assignments of the original O-D survey trip 
matrix. This was necessary to isolate the errors of the trip distribution process from 
those inherently associated with the zone plan and the O-D survey data used for calibra- 
tion. 


Check-Line Comparison 


rms errors and chi-square errors resulting from the 1-purpose and the 4-purpose 
trip distribution assignments are given for each test plan in Table 12. The results of 
both trip distribution assignments showed very close agreement with the O-D assignment 
for all zone systems tested. Both rms and chi-square errors were only slightly smaller 
for the 4-purpose trip distribution model than for the 1-purpose model. Increases in 
errors relative to the corresponding O-D assignment error decreased with reductions 
in the number of zones. 

The check-line comparison indicated that reductions in the number of zones tended 
to reduce errors resulting from the trip distribution process taken alone. Further- 
more, the elimination of separate-purpose and vehicle-type categories did not signifi- 
cantly affect the assignment results at any level of zone aggregation. 


District Comparison 


The reliability of assigned travel throughout the network was assessed by accumulat- 
ing traffic volumes from the assignments of 1-purpose and 4-purpose trip distributions 
for each of the 56 districts and by comparing them with the volumes from the multiple- 
route assignment for system 607. Table 13 gives the total vehicle-miles and vehicle- 
hours for the 3 test systems. Again, only marginal differences occurred between the 
1-purpose model and the 4-purpose model. Both models produced total travel estimates 
within 3 percent of the estimates obtained from the assignment of the O-D trip matrix. 
Table 14 gives the comparisons for rms errors of vehicle-miles and vehicle-hours re- 
spectively. The errors were greater when comparison was made by district. Table 15 
gives the cumulative frequency of districts by percentage difference of vehicle-miles. 
These data suggest that the 4-purpose distribution provided more added accuracy than 
was indicated by the check-line comparison. 


Link Volume Distribution 


The analysis of individual link volumes for 1-purpose and 4-purpose distributions in- 
dicated only small differences for most volume ranges. Trip distribution assignments 
produced fewer low-volume links (0 to 2,000 volume range) than the O-D survey assign- 


72 


Table 12. rms and chi-square assignment errors for links crossing 16 
check lines. 


4-Purpose 1-Purpose 
O-D Survey Data Distribution Distribution 
rms Chi-Square rms Chi-Square rms Chi-Square 
System Error Error Error Error Error Error 
263 7,047 635, 564 7,631 774,738 7,780 785,465 
(8.3) (21.9) (10.4) (23.5) 
144 7,454 647,452 7,896 741,073 7,967 748, 603 
(5.9) (14.5) (6.9) (15.6) 
056 9,618 1,189,126 9,801 1,228,486 9,983 1,263,340 
(1.9) (3.3) (3.8) (6.2) 


Note: Figures in parentheses indicate percentage increase in errors over O-D assignment errors, 


Table 13. Vehicle-miles and vehicle-hours by distribution assignment for 
3 test systems. 


Vehicle-Miles (thousands) Vehicle-Hours (thousands) 
System O-D 4-Purpose 1-Purpose o-D 4-Purpose 1-Purpose 
263 9,128 9,357 9,362 361 367 365 
144 8,710 8,955 8,896 343 349 346 
056 8,533 8,787 8,745 330 345 342 


Table 14. rms errors for district comparison of vehicle-miles and 
vehicle-hours by distribution assignment for 3 test systems. 


rms Error Coefficient of Variation 


System O-D 4-Purpose 1-Purpose O-D 4-Purpose 1-Purpose 


Vehicle-Miles 


263 9,826 18,141 22,179 6.0 11,1 13.6 
144 16,244 19,348 22,412 10.0 11.8 13.7 
056 22,160 27,551 31,601 13.6 16.9 19.4 
Vehicle-Hours 

263 411 692 839 6.4 10.8 13.0 
144 637 779 903 9.9 12,1 14.0 
056 879 1,021 1,181 13.6 15.8 18.3 


Table 15. Cumulative difference by district of vehicle-miles by distribution 
assignment for 3 test systems. 


Percentage System 263 System 144 System 056 

Difference > FO 
Range 1-Purpose 4-Purpose 1-Purpose 4-Purpose 1-Purpose 4-Purpose 
+ 0.00- 2.49 ig 12 9 ll 5 8 

+ 2.50- 4.99 19 23 15 21 14 17 

+ 5,00- 7.49 29 31 26 28 20 22 

= 7.50- 9,99 37 39 33 33 26 29 
+10.00-12,49 39 46 35 38 32 37 
£12,50-14.99 43 49 40 45 36 39 
£15.00-19,99 50 53 47 50 44 46 
+20.00-24,99 53 55 52 53 47 49 
+25.00-29,99 54 55 53 54 50 50 
+30.00-34.99 55 55 54 54 51 51 
+35.00-39.99 55 56 54 54 52 53 

+40.00 and 


more 56 56 56 56 56 56 
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ment. This was to be expected because the trip distribution process generated trips 
for many cells of the matrix where the trip frequency is too small to get complete cov- 
erage in an O-D sample survey. There were also improvements relative to the O-D 
assignment for links with volumes of more than 30,000. 

Figures 7 and 8 show assignments resulting from the 1-purpose and the 4-purpose 
distributions for systems 263 and 144. The line-printer graphs shown in these figures 
illustrate the high degree of correspondence between the 2 distributions for all link- 
volume ranges. 


TRIP DISTRIBUTION TEST FINDINGS 


The study established that trip distribution models can readily be calibrated for anal- 
ysis systems with widely varying numbers of zones. One-purpose and 4-purpose models 
were developed that reproduced the observed trip patterns within acceptable limits. It 
was found that the friction or impedance functions, which reflected the bahavior of trip- 
makers with regard to travel time, were not affected by zone aggregation, except for 
the intervals that applied to intrazonal trips. Even so, these trips did not cause any 
particular problem in the calibration process, provided that the intrazonal travel times 
for large zones were carefully determined. Even a small change in intrazonal times 
(e.g., 1 min) can have a significant effect on the number of trips leaving and entering a 
zone when 50,000 or more trip ends need to be distributed. 

The investigation evaluated assignment results from 1-purpose and 4-purpose trip 
distributions. It found that the rms errors resulting from both trip distributions for 
links crossing 16 check lines were only marginally larger than the errors from the 
assignment of O-D survey data. The errors ranged from 2 to 10 percent, the smallest 
errors being associated with the least detailed zone plan and the largest errors with the 
most detailed zone plan. The errors of 1-purpose distribution assignments were con- 
sistently larger than those of 4-purpose assignments for each test system; however, 
the increase in error was much less than the increase in error over the O-D assign- 
ments. 

The analysis of vehicle-miles and vehicle-hours traveled in 56 districts indicated 
similar results although, with the exception of system 263, the increase in errors from 
the 4-purpose distribution to the 1-purpose distribution was similar to the increase in 
assignment errors between the O-D survey and the 4-purpose trip distribution. Close 
analysis indicated, however, that most large errors were due to the effects of the zone 
plan rather than the trip distribution process. 

In summary, the findings indicated that reductions in the number of zones did not 
appreciably diminish the accuracy of trip distributions. Further, 1-purpose trip dis- 
tributions were only very slightly less accurate than separate distributions of several 
trip purposes. This finding is of major importance because of the substantial reduc- 
tions in computer time and analysis effort that would result from the use of a 1-purpose 
distribution model. 


CONCLUSIONS 


The study has determined the levels of accuracy in traffic assignments and trip dis- 
tributions that may be expected from the use of fewer analysis zones than normally used 
in conventional transportation studies. 

Assignment tests with a total vehicle O-D survey trip matrix have identified the level 
of accuracy in traffic assignments for 5 widely varying zone plans, thus providing evi- 
dence to transport planners for the selection of zone numbers according to the objectives 
of their studies. For major route-planning purposes, adequate traffic assignments 
should be obtained from zone plans with an average of 10,000 to 15,000 trip ends per 
zone. For predictions of traffic growth within transportation corridors or segments of 
urban areas, zone plans with as many as 30,000 trip ends per zone should yield traffic 
assignments with sufficient accuracy. 

The comparison of O-D traffic assignments with trip distribution output matrices 
indicated that the accuracy of the trip distribution process was not affected significantly 
by reductions in the number of analysis zones. Thus, decisions regarding the design 
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Figure 7. Comparison of 4-purpose and 1-purpose distribution 
assignments for system 263. 
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Figure 8. Comparison of 4-purpose and 1-purpose distribution 
assignments for system 144. 
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of zone systems should be based primarily on the level of acceptable accuracy for the 
traffic assignment phase alone. For the trip distribution phase of a study, the use of 
a 1-purpose trip distribution with partition of the matrix offers significant savings in 
computer processing and leads to little reduction in accuracy compared to the usual 
distribution of several trip purposes. 

The data on traffic assignment errors presented in this study should serve as a guide 
for the selection of optimal zone size for transportation planning studies. The tech- 
niques developed are appropriate for testing the potential for aggregating the initial zone 
system of completed studies for use in continuing planning. Generally, zone aggrega- 
tion offers significant reductions in the effort required to prepare input data and to ana- 
lyze transportation study results. Particularly important savings may also be achieved 
in developing projections of land use activities for trip estimation. This could be wel- 
comed by land planners, who prefer to work with larger zones than are normally used 
in transportation studies. Substantial savings can also be achieved in computer time 
for processing trip distributions and assignments. 

We conclude from this study that for many aspects of transport planning, where ac- 
curacy at the level of individual arterial routes is not essential, standard traffic assign- 
ment and trip distribution methods can reliably be applied with substantially fewer zones 
and fewer trip purposes than used in conventional transportation studies. 

In view of the potential savings resulting from the use of fewer zones, further re- 
search should be undertaken to investigate the effects of zone size on other aspects of 
the transportation planning process. For new or repeat studies, consideration should 
be given to the effect on the accuracy of trip-end estimations. Further work is also 
required to assess the effects with capacity-restraint assignments and with peak-hour 
assignments. Changes in the normal approach to assignment may prove feasible, so 
that individual route accuracy can be achieved while the advantages of large zones are 
retained. For example, intrazonal trips could be allocated uniformly to links within a 
zone as part of the assignment process. More fundamental innovations in traffic assign- 
ment may follow if it is appreciated that large numbers of zones are not necessarily 
basic to achieving acceptable levels of accuracy. This could open up new horizons for 
testing alternatives, for stage development planning, for economic evaluation, and for 
properly integrating road and public transport planning, if the present transport planning 
process is made less cumbersome, less time-consuming, and less costly. 


DEVELOPMENT AND APPLICATION OF 
DIRECT TRAFFIC ESTIMATION METHOD 


Yehuda Gur, Department of Systems Engineering, University of Illinois at 
Chicago Circle, and Chicago Area Transportation Study 


The paper describes the following major problems encountered in the pro- 
cess of programming and testing the direct traffic estimation method: how 
to estimate contributions to volumes of individual zones, how to deal with 
assymetric networks, how domain overlap occurs and can be identified, 
and how to improve results by smoothing the boundaries between main and 
prime domains. The major conclusions of the first-stage testing of the 
model are as follows: (a) The model produces reliable expressway volume 
estimates that may be used for planning and analysis; (b) currently, it is not 
recommended that the model be used for estimating arterial volumes, and, 
in case sucha use is made, the range of possible errors should be kept in 
mind; and (c) application of the model gives an excellent method for describ- 
ing the probable sources of traffic on a given link and the function of the 
link in serving the travel demand as a part of the network. Visual repre- 
sentation of the results using the available auxiliary programs is of 
special help. 


eWITHIN the framework of urban transportation planning, estimates of traffic volumes 
on major arteries have special importance. They are essential inputs for both planning 
and design and for evaluation of alternative plans and programs. Traditionally, link 
volumes on the highway network are estimated through the chain of land use-trip 
generation-conversion-distribution-assignment (4). This method is aimed at and is 
relatively efficient for estimating link volumes for the whole region or large parts of it. 
However, where volumes estimates are required for a small number of links, running 
the full assignment is quite inefficient. 

In cases where information on sources of traffic on particular links is required, the 
same problem occurs. The assignment package can supply this information but only 
through a tedious and expensive process. The direct traffic estimation method (DTEM) 
is efficient in the analysis of separate links, and can be used where such an analysis is 
needed, 

This method was developed by Schneider in 1965 (1). It was computer programmed 
and applied by the Tri-State Transportation Commission (TSTC) in 1967 (2). In this 
method, link volumes are estimated by examination of the link itself, not the interchange 
it serves. The link volume is estimated as a function of the potential of the region to 
generate traffic on the link and the availability of parallel links to serve this traffic. 
The method also gives a measure of the relative importance of different zones in the 
region as sources for traffic on the link. Analysis of each link is done completely in- 
dependently of analysis of other links. 

After careful examination of the program being used by TSTC, it was found that the 
use of this program by the Chicago Area Transportation Study (CATS) would require 
major changes either in the CATS data system or in the program. This was due to dif- 
ferences in the geographical identification system and network coding rules used in the 
2 studies. Since DTEM was intended to be used by CATS, together with the existing 
assignment program, it was decided to reprogram DTEM so that it would accept the 
existing CATS assignment input tapes. 

A program was written in SIMSCRIPT for the CDC 6000 series computers. (Cur- 
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rently, a FORTRAN version of the program is being written.) Current size limitations 
are 19,999 nodes maximum, 9,999 zones maximum, and 


CM = 12,000 + N* {(2 + (C/2)]} + 22 


where 
CM = central memory size (words) (50,000 for the CDC 6400 and 130,000 for the 
CDC 6600), 
N = number of nodes, 
Z = number of zones, and 
C = maximum number of links connected to 1 node. 


Together with the main program, a system of auxiliary programs, mainly in FORTRAN, 
was prepared. These programs enable efficient calibration, preparation of input, print- 
ing of summaries, and plotting of output. A users’ and reference manual was also 
prepared for these programs. 


THE THEORY 


The theory of the direct traffic estimation method was developed in another paper 
(1). (In the Appendix some theoretical difficulties are discussed, and modifications to 
the original theory are suggested.) Here it is intended to give only the logic of the 
method, definition of terms, basic assumptions, and major results of the theory. This 
section also describes a number of problems in application of the theory and the way 
these problems are dealt with. 


Terminology 
Figures 1 and 2 show graphical description of some of the following terms: 


1. Point of interest (POI). A point on the road network (not an intersection) whose 
volume is being estimated (Fig. 1). 

2. Link of interest (LOI). The link on which the POI is located. For convenience 
in reference, the direction of the link is designated as north-south. 

3. Main nodes. The nodes at the ends of the link of interest. 

4. Turning links. The links connected to the main nodes, excluding the main link. 
The turning links end with the turning nodes. 

5. Path value (F,,). The generalized cost of travel between 2 points, i andj. Itis 
defined as the value of the minimum path between the points on the coded network. 

6. Main boundary. A line through the POI, approximately perpendicular to the link 
of interest, that divides the region into north and south domains. The path value from 
the POI to any point on the main boundary is the same going north or south. 

7. Domain (d). A definite area within the region (Fig. 2). 

8. Main path. The minimum path from the POI (to a point). 

9. Prime path. The minimum path from the main boundary (to a point). 

10. Main domains (n,s). The north domain (n) is part of the region that is most 
easily reached from the POI going north. (That is, the best path from the POI to any 
point in the north domain passes through the main node.) Similarly, the south main 
domain is defined. Hence, the main boundary may be defined as the indifference line 
between the northbound and the southbound main paths (Fig. 2). 

11. Prime domains (n’, s’). The prime domains are parts of the main domains 
(n, s) for which the main path is also the prime path. 

12. Decay function G(:), A decreasing function of the friction between 2 points. 

13. Domain integral (Id). 


Id = ¥ G(F,,)x V, (1) 


jed 
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where 


Fs = path value between zone j and POI, and 
v, = number of trip ends at zone j. 


The summation is done over the finite area d (a zone, subregion, or region). 


Assumptions 


1. Complete symmetry exists in volumes and path values throughout the systems; 
i.e., “= =Q,,, where Q, is 1-way volume between i and j, and a, = + By where F is 
directiénal path value bétween POI and zone i. 

2, Each trip end sends and receives exactly 1 trip daily. 

3. A northbound vehicle at the POI has its origin in the south domain and its destina- 
tion in the north domain. 


(In all statements following, northbound can be replaced by southbound by replacing 
n by Ss, Ss by n, n’ by s4, and so on.) 


4, A northbound trip originating in s’ is defined as free. Its destination may be any- 
where in the north. 

5. A northbound trip originating out of s’ has by definition a prime path better than 
the main path. If it uses the link of interest, it implies that its destination is in n’. 
Such trips are called fixed. 

6. The relative possibilities of trips through the POI terminating in different domains 
are proportional to the domain integrals. That is, 


P {D= j|Ded, jed} = 1/1, (2) 
where D is the destination of the trip. 


Calculation of Volumes 


Based on the previous assumptions, the daily 2-way volume through the POI has been 
found to be 


(ly ‘f * It )/G, +1,) (3) 


Partial Volumes 


Equation—Let d be a subdomain of the north domain, so that den, d’ en’, d’ed, and 
d’>@¢. The total 2-way volume from this domain through the POI is 


= 0, x y+ at)/C +1) (4) 


If we sum up the partial volumes to all the subdomains in the north domain, we get 
=(UY/@+1)lel-h+ hi) (5) 


This equation is used to calculate turning volumes and the contribution of each zone in 
the link volume. 

Turning Movements—Clearly, the path from any point in the region to the POI passes 
through exactly I turning link (as defined earlier). Hence, the subdomains (parts of the 
prime and the main domains) that belong to each turning link may be found, and the turn- 
ing movement calcuated by using Eq. 5. 

Volumes for Individual Zones—Equation 4 may be used to find 2-way volumes from 
individual zones through the POI. As usual, a zone is represented on the network as a 
point (load node) with a given number of trip ends on it. In this case, the complete 
zone belongs or does not belong to a prime domain. This might cause some inaccuracies 


79 


in the estimation of the distribution of individual zones to the links volume. In order 
to partly overcome this problem, the program enables a breakdown of zones between 
the prime and nonprime domains. In this case, a zone may belong partly to the prime 
domain. Equation 4 covers the 2 cases. 


Asymmetric Networks 


Clearly, the assumption of symmetry of volumes and path values should be approxi- 
mately true in order to allow successful application of this method. This is especially 
important near the POI. Hence, estimation of directional peak volumes or of volumes 
on 1-way links in the network is impossible. On the other hand, inclusion of 1-way links 
in the network is allowed as long as they do not cause substantially different path values 
to and from the POI. 


Direction of Paths 


Theoretically, the path values between any 2 points are the same in both directions. 
Unfortunately, 1-way links do exist and cause a number of small problems that are de- 
scribed below. 

One-Way Turning Links—The main tree is built from the POI outward (Fig. 3). There- 
fore, if a turning link is 1 way inward (case 1), the calculated turning volume on this 
link will always be 0. If a turning link is 1 way outward (case 2), then its calculated 
volume may be positive. 

One-Way Boundary Links—The main boundary is defined by a building a tree from 
the POI. The main domain of each node is determined according to the main node it 
is connected to. For the 2-way boundary link A-B, the following relation always holds: 


| Fs ii F., | <T,, (6) 
where F, is the path value of I from POI, and T,, is the link friction. If the boundary 
link is 1° way, this is not always true. In this case, it may happen that the boundary 
passes around a 1-way boundary link. 


Sensitivity of Volume to Path Values 


Errors in path values from the POI to different zones appear due to errors in coding 
the network or in link friction values. These errors may be classified as either sys- 
tematic or random errors. They may affect the estimated volume by changing the do- 
main integral values of zones or by transferring zones between domains. 

Random Errors—In the case of random errors in link friction values, the errors due 
to changing zone values are generally very small. The errors due to transferring zones 
may be quite large. These errors usually consist of a transfer of a zone in or out of 
the prime domains. This happens quite often, because generally the path value from 
the POI to a zone is never much smaller than the best alternative path from the main 
boundary. A small increase in the "main path" value may easily cause a zone to leave 
the prime domain. Errors due to transfers between domains are especially serious 
when only a few zones are included in the prime domain (in this case, the inclusion of 
any zone makes a large percentage difference in the calculated volume) and when one 
of the main domains is much larger than the other (a common case when the link is 
located near region boundaries). This problem is overcome by breaking zones between 
main and nonmain domains as follows: 


0 ifZ" <0, 
Z’ et Z i£0' <2 <1 (7) 
fT Zs Se 
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where 
Z"=0.5+k [(F - F,)/F,], 
Z’ = percentage of the zone (i.e., the trip ends in the zone) that belongs to the prime 
domain, 


F, = path value on the prime tree, 
F. = path value on the main tree (from the POI), and 


k = specified parameter (currently used k = 0.1/0.2). 


By this method, errors in volume due to misspecification of domains still cause con- 
siderable error, but much smaller than before. In the computer program, this formula 
may be applied through a special optional routine. Experimenting with this method 
shows substantial improvements in estimation of volumes on minor links with almost 
no effect on high-volume links. The problem of 0-volume links (due to nonexistence of 
prime domain) vanished. 

Systematic Errors—Systematic errors in the path values cause changes of domain 
integral values. The danger of such errors exists especially when the network used 
for calibration is substantially different from the network used for estimation. This 
problem is discussed later. 


Domain Overlap 


Sometimes, due to peculiarities in networks, a certain area belongs by definition to 
the prime domains of 2 parallel adjacent links. This is called domain overlap. An 
example of such a condition is shown in Figure 4. There, area A belongs to the prime 
domains of the 2 points of interest I and II. Clearly, the existence of domain overlap 
is in contradiction with the logic of the theory. It causes overestimation of volumes 
on the links involved. 

The existence and the extent of domain overlap in the analysis of any specific link 
are practically impossible to predict. They may be checked by plotting the domain 
boundaries of adjacent links. Generally speaking, areas with irregular networks are 
more prone to have difficult overlap problems. (Thus, in the Chicago area, which has 
relatively uniform network configuration, the problem is not especially serious.) 

The domain overlap problem may be solved by merging 2 links into 1, estimating the 
joint volume, and then externally or otherwise redistributing the volume. A detailed 
study and implementation of correction procedures for this problem are yet to take 
place. 


Concluding Remarks 

The direct traffic estimation method uses practically the same inputs as the tradi- 
tional method: trip ends, decay function, and coded network. Yet, the 2 methods are 
significantly different in the ways used to analyze the data. The DTEM is advantageous 
in solving individual links. But, it has a number of disadvantages. The more important 
are the unavailability of capacity constraints and the inability to apply the method for 
assymetric loadings. Only careful and continual use of the method will enable its prac- 
ticality as a planning tool to be evaluated and its most effective place within the set of 
available travel demand prediction models to be defined. 


CALIBRATION AND TESTING 


DTEM was calibrated and tested with CATS 1965 data. The major findings are as 
follows: 


1. The negative exponential function (Eq. 8) may be used as the decay function. 


G(F,,) = exp (-8« F,,) (8) 
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where 


Fo; = travel friction between the point of interest and zone j, 
B = coefficient, to be determined by the calibration; and 
G(‘) = decay function. 


2. The optimal f for estimation are A(1) = 0.06 f ‘for expressways, and B(2) = 
0.08 f ' for arterials, where f is the CATS friction measure. (A unit friction represents 
approximately 24 sec on a free network.) 

3. Expressway estimated volumes are close to counted volumes with a standard error 
of less than 12 percent of the mean. 

4, Estimated arterial volumes have a standard error of 47 percent of the mean. The 
error is correlated negatively with the counted volume and number of lanes. 


Method and Analysis 


Ideally, the model may be calibrated by analyzing the distribution origins of vehicles 
traveling on the link of interest. Comparing the actual distribution with the distribution 
implied by the model (as calculated by Eq. 4) allows both the form of decay function and 
its parameters to be found and checked. Such an analysis may increase substantially 
our understanding of urban travel and the performance of the model. This analysis re- 
quires data on origins of trips using the link. These data are available only through a 
roadside interview. Unfortunately, such data were not readily available for the present 
analysis (and are practically unavailable for expressways because of technical difficul- 
ties in collection.) 

In the calibration procedure used here, the decay function was chosen externally. 
Volumes on sample links were calculated by using different coefficients in the function. 
The optimal coefficients were chosen so as to minimize the sum of deviations between 
the estimated and the counted volumes. Later, the volumes estimated by using the 
chosen coefficients were analyzed to find the level of accuracy and the error properties 
of the estimates. 


The Data 


The model was calibrated and tested with CATS 1965 data. These data include an 
updated coded highway network, estimated number of zonal trip ends, and counted vol- 
umes on major roads. The volume counts on this set of data were carefully prepared 
and intensively checked. A sample of 11 expressway links and 22 arterial links was 
chosen for analysis. The sample covers a wide range of locations, geometry, and link 
volumes. 


The Decay Function 


Two decay functions were tested for use: the A function, developed by Schneider (3), 
and the negative exponential function. The two were shown earlier to fit DTEM (1, 3). 
In preliminary runs, no advantage of either of the 2 functions was apparent. Hence, it 
was decided to use the simpler one, i.e., the negative exponential function (Eq. 8). 

It should be remembered that it is possible that other functions may prove to be 
better. Until more analysis is done (preferably by using extensive data from roadside 
interviews), no final conclusion can be made. However, it should be noted that the nega- 
tive exponential is the only decay function that is completely consistent with the theory 
(see Appendix). 


Analysis of Results 


The basic theory presumes that 1 decay function may be used for all route types. 
However, it was found both by CATS (Figs. 5 and 6) and by TSTC (2) that the use of 1 
decay function causes underestimation of expressway volumes or overestimation of 
arterial volumes or both. Use of 1 decay function for the 2 cases would require manual 
adjustment of results. Such adjustments may be avoided, or at least decreased, by a 
separate analysis of the 2 link types. Such a separation was done in this analysis. 


Figure 1. Part of region around point of 
interest. 


1. Point of interest 2. Link of interest 3. Main node 
4. Turning link 4a. Turning node 5. Main boundary 


Figure 3. One-way turning links. 
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Figure 5. Difference between mean estimated 
volume and mean counted volume as function of 
8 for expressways. 
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Figure 2. Definition of domain. 
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Figure 4. Domain overlap. 
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Figure 6. Difference between mean estimated 
volume and mean counted volume as function 
of 6 for arterials. 
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Table 1. Test results. 


Expressways 
—.—_ Arterials, 
11-Link 9-Link 22-Link 
Item Sample Sample* Sample 
Mean volume, vehicles per 
day, 2-way 
Counted 120, 850 123,630 16,240 
Estimated 124,200 117, 880 16,090 
Mean error 3,350 -5,750 -150 
Standard deviation of vol- 
umes, vehicles per day, 
2-way 
Counted 59,200 63,800 9,340 
Estimated 52,100 51,800 9,300 
Standard error of estimate, 
vehicles per day, 2-way 23,900 14,600 7,810 
Standard error, percent 20.0 11.8 47 
Range of volumes, vehicles 
per day, 2-way 8,600 to 216,600 8,600 to 216,600 200 to 38,300 
Range of errors, vehicles 
per day, 2-way -17,300 to 59,400 -17,300 to 17,300 -14,000 to 13,100 
Correlation of error with 
counted volumes -0,42 


“The 9-link sample is identical to the 11-link sample after 2 links are dropped that show large domain overlap. 


Choice of Optimal 8—Figures 5 and 6 show that the mean error is quite sensitive to 
the value of 8. This sensitivity is greater in expressways than in arterials. Because 
of this sensitivity, the choice of 8 is dictated by the results and is straightforward. 
Directly from the graphs we get 8(1) = 0.06 (f"") for expressways, and 8(2) = 0.08 (£ ') 
for arterials. 

Domain Overlap—The original sample includes 11 expressway links. The errors in 
estimates of these links are given in Table 1. Two of these links were respectively the 
express and the local lanes of the same section of an expressway (Dan Ryan Expressway 
on the south side of Chicago). Analysis of the domain boundaries of the links revealed 
a strong domain overlap (as described earlier). This caused overestimation of the 
volumes on these 2 links. 

Because this situation (of parallel local and express lanes) is unique and identifiable, 
it was decided to drop the 2 links from further analysis. It is expected that using the 
means suggested earlier, i.e., estimation of the joint volume on the 2 links, will enable 
solution of this problem. 

Error of Estimate, Expressways—A summary of the error properties is given in 
Table 1. The following points are of interest: 


1. A relatively wide range of volumes was checked. 

2. The standard error of estimate is 14,600 vehicles/day, 2-way or 11.8 percent of 
the mean volume. The maximum error in the sample is 17,300 vehicles/day. 

3. The variance of the estimated volumes is lower than the variance of the counted 
volumes. This may be caused by a tendency of the model to underestimate high-volume 
links and overestimate low-volume links. 


The error of 17,300 vehicles/day (the maximum observed) is less than the capacity 
of 1 expressway lane. Such a level of accuracy is well within the range acceptable for 
planning. Although the accuracy of estimates depends also on the errors in other stages 
of the modeling (network speeds and zonal trip end estimates), it seems that this total 
approach is relatively reliable for estimating expressway volumes. 

Error of Estimate, Arterials—A summary of errors in estimates of arterial volumes 
is given in Table 1. The following observations are of interest: 


1. The range of volume tested is quite high—200 through 38,000 vehicles/day. The 
results show that further breaking of this group may be helpful. 

2. The standard error of estimate is 47 percent of the mean volume or 7,810 vehicles/ 
day. 
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3. The range of errors is from -14,000 to 13,100 vehicles/day. 

4. A significant negative correlation (-0.42) was found between the error and the 
counted volumes. This shows, as expected before, that the model underestimates high- 
volume links and vice versa. 


Although this level of accuracy may be sufficient for certain purposes in transporta- 
tion analysis, it is by no means satisfactory. More work on the model is required in 
order to improve these estimates. 

Network Speeds—Multiplication of link friction values by a constant is equivalent to 
multiplying by the same constant. Seemingly, a systematic increase or decrease in 
link friction values is equivalent to these multiplications. Such systematic changes in 
friction occur while a transfer is made among theoretical, free, loaded, and congested 
networks. (These are not accurate terms but refer to the travel friction under different 
link loadings.) Because of the high sensitivity of the volumes to 8 (as shown in Fig 5), 
it is important that, in both the network used in the calibration stage and the network 
used for prediction, the friction corresponds to the same level of load on the network. 
The lack of capacity constraints in DTEM will always cause certain inaccuracies in 
volume estimates. However, systematic bias, which may be caused by using 2 differently 
loaded networks, is at least as erroneous. 

Correlation Analysis of Errors—The errors in the arterial volume estimates were 
analyzed in order to find the reasons for the high errors and, possibly, to find ways of 
correcting them. The following results are of interest: 


1. Negative correlations exist between the error and the counted volumes, capacity 
class, or number of lanes. This fact may be explained, at least partly, by lack of ca- 
pacity constraints. It is reasonable to assume that, given the increase in link friction 
due to increase in volume, the prime domains of the small links will decrease and cause 
the total volume on these links to go down faster than on other links. 

2. The error is negatively correlated with distance to the nearest parallel express- 
way. 


Correction for these 2 variables through the regression equation causes a decrease 
in the standard error of estimates of approximately one-half. Nevertheless, it is not 
recommended that these corrections be used without further study of the causes of 
errors in the model. 


Conclusions 


1. DTEM produces reliable estimates of expressway volumes. These estimates 
may be used for planning and analysis. 

2. Currently, it is not recommended that DTEM be used for estimating arterial vol- 
umes. In case such a use is made, the range of possible errors should be kept in mind. 
3. Application of the model gives an excellent method for describing the probable 

sources of traffic on a given link and the function of the link in serving the travel de- 
mand as a part of the network. Visual representation of the results using the available 
auxiliary programs is of special help. 


Recommended Further Research 


Some theoretical problems and directions for research are suggested elsewhere (3 
and the Appendix). In this section, concentration is on operational research in the 
framework of the existing theory. 


1. It is suggested that the decay function be carefully examined by using data from 
roadside interviews when and where such data are available. 

2. An intensive examination of error properties of arterial estimates should be 
made. Large reductions in the existing error levels are very likely. 

3. A procedure that enables merging of links with overlapping prime domains into 1 
equivalent link should be developed and tested. Such a procedure may be included 
within the program by a separate auxiliary subroutine. 
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4, This model is the first one used by CATS in which absolute path values and not 
relative values only are used for predicting link volumes. A comparative study of 
scaling of the 3 main networks (1965, current, and 1985) should be carried out in order 
to ensure compatibility among the networks. 

5. Use of the available graphic output for production of predicted flow diagrams on 
major links should be studied. 


It is expected that a large part of the existing problems in the model may be solved 
by conducting the research recommended here. Besides the solution of the specific 
problems, this research may serve as an excellent tool for improving understanding 
of urban travel characteristics. 
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APPENDIX 
MODIFICATION OF THE VOLUME EQUATION 
Schneider developed the volume formula as follows (1): 


1. A hypothetical trip end is put near the point of interest. It is shown that 
Q = 1/(P, + P,) (9) 


where P, and P, are respectively the probabilities that any northbound or southbound 
vehicle at the POI will choose the hypothetical trip end as its destination. Q is the daily 
1-way volume through the POI. 

2. "Free" vehicles are defined as vehicles that begin their trips inside the prime 
domain. The remaining vehicles are called "fixed." The proportions of the free ve- 
hicles (called A, and A,) in the southbound and northbound traffic streams are found. 
This is done by equating the number of trips originating in the south prime domain with 
the number terminating there (1, Eqs. 12, 13, and 14). 

3. The probabilities P, and P, are later calculated as the weighted average of the 
probabilities that either a fixed or a free vehicle will occupy the hypothetical destination 
(1, Eqs. 9, 10, and 11). 

4. P, and P, are substituted in Eq. 1 to get the solution (1, Eq. 16). 


In the original representation of the direct traffic estimation method, Schneider (1) 
found the 1-way daily volume on a link to be 


Q: = [(L, - 1,)/@, +1,)1(01 - (@-34,) + (1 - 7,)] (10) 


where 


I, = the domain integral of d, 
n,n’, s,s’ = 4 domains, 

r, =1,/I,, and 

¥, =i AL. 
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Or, substituting the r's gives 
Qi = (1/G, +h) +h1,-171 (11) 


Brown and Woehrle (2), in applying the method, give the formula for calculation of 2-way 
link volume as 


Qe = [1/(L, + 1) 1-(1v1, + 1+ Lh) (12) 


They refer to Schneider (1) as the source of their formula but give no explanation for 
the differences. Schneider himself suggested (1, p. 115) that modifications to his Eq. 
16 were required and were being implemented but did not specify why and how they 
were to be done. 

The analysis here results in a third volume formula as shown below. 


Choice of Destination and Route by Northbound Free Trip 


A typical situation is presented and is shown in Figure 2. 

A domain dz is located in the south prime domain. A point p is found so that every 
trip from dz to the north has to pass through it. Here, the choice of the exact destina- 
tion and the route of such a trip are discussed. Because all the vehicles going to the 
north from dz pass through point p, it will be used as a point in which the trips are ob- 
served, ensuring that no trips are eliminated from consideration. Equation 13 gives 


P {Ded} =1*/I* (13) 


where D is the destination of the trip, d is some domain in the north, and I* is the 
domain integral with respect to p. 

Clearly, at p, the trip is completely free to choose a route. If dis in the north prime 
domain, it is very likely that the trip will use the path through the POI. When d is not 
in the north prime (for example, d =d, in Fig. 1), other paths may be considered. In 
particular, the possibility of crossing the main boundary at q (Fig. 2) cannot be ignored. 
q is the point on the main boundary nearest tod. This possibility should be accounted 
for because it is considered a valid choice while the symmetric case is analyzed (q is 
the POI, trip originates at di). Without unnecessary complications, it may be assumed 
that the probabilities of crossing at the POI and at q are each equal to "h. 

Putting this in probability notation gives the following: 


1. For any trip originating in s’ and going to n, 


P (KX = POIID¢n’} = % (14) 
P (KX =POI| Den’}=1 (15) 
P Ded} =1*/Tt (16) 


where X is the crossing point of the main boundary. Taking d = n’, gives 
P{Den’} =I /It (17) 


P{X =P¢@I} = 4° (1 - 1x /Tk) + 14/14 
(18) 
= (I#+ Is. )/(2 + Te) 
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2. For those trips that also pass through the point of interest, 


P{Den’ } = (+ /l*) - P{X = POT} 


(19) 
= (2-Ix% )/(T- + 14) 
In case the decay function being used is the negative exponential, it always gives 
Th /l¢ = 1, /ly 
and Eq. 19 can be modified into 
P{Den }=(2- I)/(, +1) (20) 
without any loss of generality. For convenience of notation, define 
Yr, = (2 : I,’)/(, + 1) 
(21) 
r, = (21; )/(, + Is) 
Volume Formula 
Using the steps described in the preceding section gives 
QA: = Q,(1 = A,) + Q.r A, 
(22) 
Q.A, = Q,(1 - A,) + Q,7,A, 
(parallel to 1, Eqs. 12 and 13). 
The solution of this system (clearly, Q, = Q;) gives 
Bis = r,/(1 -(1 7 t,) “i = Ey 
(23) 
A; = r,/L1 -(1 = Ys) “(1 = r.) 4 
(parallel to 1, Eq. 14.) 
P, = 1, fA, 2/0, +1.) + 1 - A.) (1/093 pay 
P, =I.fA,+ (2/@, +12)]+(1- A.) -(1/13)} 
(parallel to 1, Eqs. 10 and 11). 
Putting Eq. 23 into Eq. 24 gives 
P, = (21,/(I, + sd OF +Y;, -T,° t.] 
(25) 


P, = BL/G + T)Virs + 2s - Tae Fel 
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To conclude, take I, = 1 and put Eq. 25 into Eq. 12, which gives 
Qs = [L,- 1, +1]°LI/,+h+n+)) (26) 


where Q; is the 1-way volume through the POI. 


Some Observations 


A major shortcoming of this approach is the necessity of analyzing the problem at 
the origin instead of at the POI. But this procedure seems necessary to make the theory 
of DTEM completely valid. Because of this procedure, it is necessary to use the nega- 
tive exponential function as the only possible decay function (for the crucial passage 
from Eq. 19 to Eq. 20). Only the negative exponential function has a "complete lack of 
memory" required to assume that the residual trip length distribution is always equal 
to the total trip length distribution. 


TOWARD AN EFFICIENT HIGHWAY SERVICE 
ESTIMATOR FOR CONGESTED NETWORKS 


Hubert Le Menestrel and Bruno R. Wildermuth, Wilbur Smith and Associates, 
Columbia, South Carolina 


This paper presents a new approach to obtain estimates of highway ser- 
vices on congested networks. Specifically, a 1-pass incremental capacity- 
restraint procedure is detailed that generates logical paths for reuse with 
the average network to obtain travel-time and travel-cost estimates. The 
method is based on the multiple-routing principle for path generation and 
Schneider's 1-pass capacity-restraint technique. The operation of the pro- 
cedure is illustrated with an example, and results of several analyses are 
also presented. Suggestions are made for further research to establish 
reliable procedures for the estimation of average speeds and to investigate 
further the validity of the capacity-restraint paths in relationto the loading 
sequence used by the procedure. 


°THE RAPIDLY growing expenditures for urban facilities at all levels of government 
have been paralleled by increased competition for public funds and by public awareness 
of the need to set community goals and objectives within which the full range of possible 
alternatives must be evaluated. Consequently, more demanding evaluation and decision 
criteria need to be established to facilitate complete consideration of the impacts of 
alternative transportation systems on the total urban system. 

The transportation planning process provides the basic structure with which the 
characteristics of alternative networks can be measured and their respective abilities 
to satisfy objectives and goals evaluated. It provides the travel demand forecasts that 
serve to evaluate the potential cost and benefit patterns. However, this process is 
often limited and cumbersome in its capabilities for estimating service aspects of 
alternative transportation systems. 

Within the past 15 years, urban transportation planning has relied increasingly on 
mathematical models and electronic computers as its basic tools. During this period, 
both the models and the computer systems have steadily been improved, although ad- 
vancements in the design of computers have far outstripped those of analytical models. 

Primary emphasis in the development of models and associated computer programs 
has been on analysis and estimation of travel demands and capacity evaluations. In- 
creasingly, added emphasis is also placed on the modal-choice decision process and 
systems evaluation procedures. 

A recent report (1) identified the following means that should be provided by trans- 
portation systems evaluation procedures: 


1. Measuring and valuing the effects of transportation system changes on travelers 
and on community residents; 
2. Estimating the system costs of transportation changes; and 


3. Relating transport changes to high-level goals and to overall framework for 
decision-making. 


To operate efficiently, service estimators and other systems evaluation models 
must be designed to accept travel demand projections and network descriptions from 
other transportation analysis models. Furthermore, their outputs must be structured 
for complete compatibility with other models to facilitate the explicit incorporation of 
service variables into trip generation, distribution, modal-split, and assignment 
models. Only then will it be possible to supply these models with measures of trans- 
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portation services (e.g., travel times and travel costs), which are consistent with 
the anticipated levels of network congestion. 

To date, major emphasis in evaluating alternative transportation systems has been 
on travel benefits. Estimates of transportation systems services, therefore, have 
been limited largely to measures of place-to-place travel times. Such estimates are 
also used most often to provide inputs and feedbacks to many travel demands and 
modal-choice projection models. 

Not infrequently, however, analysts have ignored the feedback effects of congestion 
on the generation of demand and the projection of modal-choice decisions. The main 
reason for ignoring these feedback effects undoubtedly are the cumbersome and costly 
efforts required to obtain service estimates from existing analysis tools. Existing 
packages for urban transportation planning provide a number of programs that, when 
properly sequenced, can produce estimates of zone-to-zone travel times and travel 
costs for congested networks. 

For highways, the programs required are capacity-restraint, link-costing, and 
skim-impedance-path programs. Capacity restraint in itself requires a series of 3 
programs to be run in repeated succession. However, application of the capacity- 
restraint series of programs will only produce the data that, when averaged, will re- 
flect the network conditions under the given level of congestion. This network can 
then serve to determine minimum-impedance paths from which zone-to-zone travel 
times and travel costs can be estimated. 

The remainder of this paper describes a fast and efficient method that produces 
reliable estimates of zone-to-zone travel times and travel costs for congested highway 
networks. This procedure, which is easily operated at a fraction of the cost of other 
methods, has been tested on a number of highway networks for different urban areas. 


ME THODOLOGY 


To obtain estimates of highway services for congested networks requires an efficient 
capacity-restraint method. The method presented here produces, with 1 application 
and without iterations, an average network description (reflecting the level of conges- 
tion) and a set of valid "best path" trees along which zone-to-zone travel times and 
travel costs can be accumulated from the average network. The technique is based 
essentially on 2 concepts: a recently developed 'multiple-routing" best path procedure 
that utilizes a stochastic approach and a modified version of Schneider's incremental 
1-pass capacity-restraint model. 


Multiple-Routing Principle 


A procedure has been devised that eliminates the most critical problem of the min- 
imum-path procedure and routes directional paths between pairs of nodes over all 
routes whose times differ from that of the shortest route by less than a given propor- 
tion. This procedure, called multiple routing, has made it possible to ensure proper 
balance of flows between alternate routes of equal merit. Theoretically, the multiple- 
routing principle allows the directional path between any pair of nodes to be different 
for each path generated. In practice, of course, the number of different paths is 
limited by the acceptable alternatives available within the network configuration. Mul- 
tiple routing assumes that drivers associate a "perceived" travel time with each link 
of the network and that not all drivers will necessarily perceive the same time for a 
link. 

Stochastic Approach—The multiple-routing principle is based on a stochastic ap- 
proach that adds a long-sought element of realism to traffic assignments. Use of the 
stochastic process for the multiple-routing principle is limited to the generation of 
perceived link times that are drawn at random from a normal distribution of times 
for each link. Thus, maximum use can be made of existing concepts by relying on the 
fact that in the average network each pair of nodes will be traversed many times be- 
cause of the large number of centroids normally used. 

This approach requires very little additional computer time because only 1 tree is 
generated for each centroid. The only extra time is that required to calculate each 


91 


time a link is being considered, a random sample from a normal distribution with a 
mean equal to each link's given impedance value. 

Because most routes consist of a string of links, it is not possible for the stochastic 
process to generate unreasonable routings. The sum of perceived link times will dif- 
fer on the average from the sum of mean link times by a smaller percentage for long 
routes than for short routes with few links. This corresponds well with the real world. 
For example, a driver might choose a route that takes 7 min, when the best available 
route takes only 5 min, and thus accept a 40 percent increase in time. On the other 
hand, it seems much less likely that anyone would choose a route that takes 70 min 
when the best route available takes only 50 min. Clearly, the multiple-routing prin- 
ciple reflects likely actual route choices by drivers. 

Effect of Alternate Routes on Link Flow—In the application of the multiple-routing 
principle as presented here, all trips from 1 centroid to another centroid will use the 
same path; in effect, this is the all-or-nothing principle. However, not all trips pass- 
ing from 1 node to another will use the same path, provided, of course, that the net- 
work consists of a reasonable number of centroids. The incidence of multiple-source 
centroids, therefore, serves to distribute the traffic flow onto all acceptable alterna- 
tive routes. In the method described here, the route chosen for a particular trip may 
to some degree be-arbitrary, in the sense that it is determined by a process of chance. 
The total traffic flows, however, on each link are clearly insensitive to the stochastic 
process. 


One-Pass Incremental Capacity-Restraint Procedure 


Previous research and experience with capacity-restraint procedures have indicated 
that to achieve stable speed-flow relations in congested networks requires either an 
incremental procedure or an average of the results of several iterations with the all- 
or-nothing method. Stability of speed-flow is attained when the use of one or more 
iterations or increments would not significantly alter the flow or the speed of any link 
in the network. The application of the multiple-routing method of building best paths 
will reduce the amount of imbalance generated for each assignment step. Hence, a 
reduction in the number of assignment increments or the number of iterations can be 
justified. 

Alternatively, in combination with multiple routing, it is possible to devise a more 
efficient way of selecting increments of the total traffic demand for assignment at each 
step. The 1-pass incremental procedure based on Schneider's basic approach presents 
such an alternative. 

Schneider's Assignment Concept—Schneider's basic philosophy was founded on the 
assumption that each unit of traffic seeks to travel along the best available path. Fur- 
thermore, the addition of each unit of flow to a link would, in effect, reduce the speed 
of travel on that link and, therefore, create a different best path situation for each 
additional unit of traffic demand desiring to move through the network. 

The method, originally developed for the Chicago Area Transportation Study (2) 
combined trip distribution and traffic assignment into a single comprehensive process. 
Only the assignment concept, which substantially reduces the sometimes troublesome 
effects caused by changes in the loading sequence, is used here. It is a 1-pass incre- 
mental process whereby the fraction of the total traffic demand loaded during each in- 
crement is the total demand from 1 origin centroid. 

Trips from the first origin centroid in the randomly selected loading sequence are 
loaded on the minimum-path tree determined on the basis of the input travel times. 
Link times are then updated according to an exponential time versus volume-capacity 
function. The travel demand from the next centroid in the random sequence is then 
loaded along the minimum path determined from the revised network times. This 
process is repeated until the entire traffic demand matrix has been assigned. A ran- 
domly selected loading sequence is used to minimize the effects of restraint on routings 
and travel times for traffic of the same geographic origin. 

An Approach Using Multiple Routing—Schneider's incremental method was efficient 
in its use of only 1 tree per origin centroid. However, it required an excessively large 
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number of network modifications (1 per origin centroid). Apart from conceptual rea- 
sons, the frequent modifications served to reduce the problems associated with the use 
of minimum-path, all-or-nothing procedures. Multiple routing has inherent balancing 
qualities that allow, in combination with the incremental 1-pass technique, a substan- 
tial reduction in the frequency of network modifications without loss of stability. 

The increment of traffic demand that can be allocated between successive network 
updates, without excessively increasing the flow on any 1 link, can readily encompass 
traffic demands from several centroids as long as drastic changes in travel times are 
being avoided. The probability of any link being used by 2 or more successive trees is 
very small if a random-loading sequence is used because each origin centroid is nor- 
mally separated spatially from the preceding and the following centroid. In addition, 
the characteristics of multiple routing further reduce the probability that successive 
trees will use the same path between pairs of nodes. 

As the flow on many links reaches the practical capacity level, it is more important 
to load smaller fractions at each increment. This is readily controlled by selecting 
fewer origin centroids from the predetermined sequence. The total number of net- 
work modifications, which account for the increase in computer time above that of the 
all-or-nothing technique, is reduced by a factor of 10 or more compared to Schneider's 
original concept. 

Speed-Flow Relation—Schneider used an exponential function to modify travel times. 
This assumption does not conform to actual speed-flow relations observed on highways 
but yields increasingly lower speeds with increasing volume-capacity ratios. A "'real- 
life'' speed-flow function, on the other hand, would approach a vertical tangent at 
maximum attainable capacity. The restraint function, however, must not be allowed 
to preclude altogether the assignment of flow beyond the maximum capacity level. 

This is because one of the functions of traffic assignments is to identify potential ca- 
pacity deficiencies within travel corridors of transportation networks. 

The function used must first of all satisfy the principle of the method rather than 
direct itself to a precise simulation of the real world. Links that are loaded with a 
volume reaching capacity after only a small percentage of the total demand has been 
processed require more drastic modifications than links that reach the same volume- 
capacity ratio at the end of the loading process. In short, the model must anticipate 
the effects of the additional demand still to be loaded and react in a way that will pre- 
vent (to a certain extent) its volumes from exceeding its capacity by 2 or 3 times. 

Stability and Convergence of Procedure—The 1-pass incremental method permits 
estimates of speeds and flows in congested networks with a minimun of computer time. 
In addition, it avoids the nonconvergent unbalanced estimates of flows that characterize 
iterative methods, as illustrated by the following example. 

Figure 1 shows a portion of a larger network in which many trees are routed be- 
tween points A and B. Three alternative paths exist between A (a bridge head) and B 
(the main access point to a major center). Path 2 is an old road with a directional 
capacity of 2,500 vehicles, while paths 1 and 3 are newer, wider facilities with direc- 
tional capacities of 6,000 vehicles each. Travel times at practical capacity are 36, 34, 
and 37 integer time units respectively (most network analysis computer programs 
work with integer values). Volumes entering from 5 assumed source-centroids are 
as shown. 

Table 1 gives the progression of calculated travel times and volumes loaded on 
each path for 4 iterations of the FHWA capacity-restraint procedure. The first iter- 
ation (free flow) loads the total demand of 5,500 vehicles onto path 2, the old narrow 
road. This leads to drastic overloading and consequently a substantial increase in 
travel time for path 2. Times for paths 1 and 3, on the other hand, are reduced be- 
cause of the absence of flow. The second iteration loads all traffic onto path 1. This 
leads to a downward adjustment of travel times for paths 2 and 3. In iterations 3 and 
4, traffic is loaded alternatively on path 3 and 1, indicating the beginning of a noncon- 
vergent shifting process. The example clearly establishes the need to average the 
flows of all iterations to achieve reasonably stable flows. In fact, for the particular 
example, 4 iterations are not sufficient to attain stability. 


Figure 1. Network configuration with 3 alternate Table 1. Path times and flows for 4 iterations with averaging 


paths between A and B. method. 
Path 1 Path 2 Path 3 
Iteration Time Flow Time Flow Time Flow 
1 36 Q 34 5,500 37 0 
2 35 5,500 59 Qo 36 0 
3 35 0 57 0 35° 5,500 
4 34 5,500 55 G0 3§ 0 
Avge 2,750 1,375 1,375 
Final V/C ratio 0.46 0.55 0.23 


results would not be affected. 


Table 2. Path times and flows for 5 centroids with 1-pass incremental method. 


Path 1 Path 2 Path 3 

Cumu- Cumu- Cumu- 

lative lative lative 
Increment Time Flow Flow Time Flow Flow Time Flow Flow 
1 31 0 0 30 1,200 1,200 32 0 0 
2 31 800 800 38 0 1,200 32 0 0 
3 32 0 800 38 0 1,200 32 1,600° 1,600 
4 32 500 1,300 38 0 1,200 36 0 1,600 
5 34 1,400 2,700 38 0 1,200 36 0 1,600 
Final V/C ratio 0.45 0.48 0.27 


“If the stochastic process would have selected path 1 instead of path 3, final flows would be 2,400 for path 1 and 1,900 for 
path 3. 


Table 3. Path times and flows for 100 centroids with 1-pass incremental method. 


Path 1 Path 2 Path 3 

Cumu- Cumu- Cumu- 

lative lative lative 
Increment Time Flow Flow Time Flow Flow Time Flow Flow 
1 31 385 385 30 605 605 32 110 110 
2 32 440 825 34 165 770 32 445 605 
3 34 495 1,320 36 55 825 33 550 1,155 
4 36 330 1,650 36 330 1,155 35 440 1,595 
5 36 495 2,145 38 220 1,375 37 385 1,980 


Final V/C ratio 0.36 0.55 0.33 


*It is assumed that, in the case of equal times, path 3 rather than path 1 will be chosen. If 
path 1 were chosen first, path 3 would be selected in iteration 4. In either case, overall 
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Table 2 gives the progression of volumes and travel times with the 1-pass incre- 
mental method. For the example, with only 5 centroids, each increment's traffic 
demand is assumed to be that of 1 centroid. The initial times for all paths are orig- 
inally set equal to 0.87 times the value given at a flow level equal to practical capacity. 
(This value corresponds to the ratio of the travel time under free-flow conditions 
versus the travel time at a flow equal to practical capacity as defined by the Highway 
Capacity Manual.) Centroids are assumed to be selected for loading from left to 
right. A flow of 1,200 vehicles is assigned first to path 2 and results in a substantial 
increase in time on that path. The demand for increment 2 is loaded onto path 1 and 
increases that path's travel time. The third increment will choose either path 3 or 
path 1 depending on the random samples generated. This results in a condition where 
the other path will receive the flows from the remaining 2 increments as illustrated. 

This example demonstrates the economical operation of the proposed method. With 
only 5 trees and 4 network modifications, a speed-flow condition was achieved that 
appears more stable than that obtained with 20 trees (4 iterations with 5 trees each) and 
3 network modifications. Because network modifications require only marginal 
amounts of computer time compared to building sets of trees, it is obvious that the 
1-pass method achieves substantial savings. The added cost of building multiple- 
routing best path trees (approximately 15 percent more than minimum path trees) 
does not substantially affect these cost savings. 

The smoothness with which the 1-pass method increases the volumes on each path 
from 0 to their final flow is barely indicated by this example. In a real network with 
several hundred origin centroids, the smoothness of increase in flows would be much 
more pronounced because the volume loaded along each tree would represent a smaller 
fraction of the final flow. If there were 100 centroids instead of 5 and an average vol- 
ume of 55 trips per centroid, the loading process for the sample network would pro- 
ceed as given in Table 3. With 20 trees being generated and loaded during each in- 
crement, the benefits of multiple routing are now clearly apparent. Modifications of 
travel times are more gradual and each path receives some flow in every increment. 

Calculation of Average Link Speed—The average speed on any link in a congested 
network is determined from its flow and the total vehicle time traveled on the link. 
Total vehicle time can be expressed by the sum of the products of the number of ve- 
hicles assigned during each increment (between 2 network modifications) and the 
travel times at which those vehicles have been assigned. ‘This is not easy to calculate 
because core storage limitations preclude keeping a separate accumulator for each 
link. 

Most capacity-restraint assignment procedures solve this problem by calculating 
the sum of the products of the total assigned volumes by an average travel time calcu- 
lated from the input time and the time used during the last increment or the last 
iteration. Analysis of this problem suggests that the area under the time-versus- 
volume curve would yield an acceptable approximation of vehicle time. Use of the 
area under the curve will result in a slight overestimation of the correct value caused 
by the assumption of a smooth curve instead of the resulting step function (Fig. 2); 
however, with proper adjustment, the calculated value presents a reliable estimate. 

Travel-time calculations for paths 1, 2, and 3 of the 5-centroid example previously 
illustrated would result in average values of 34, 33, and 33 time units. For the 100- 
centroid example (Table 3), average values would correspond to 33, 34, and 34 time 
units respectively. The corresponding estimates for the iterative method, derived 
from the average volumes and the fourth-power relation, are 32, 30, and 32. If the 
estimates are made on the basis of speeds used for each iteration weighted by the 
volume loaded, the values would be 35, 34, and 35 time units. The resulting values for 
the 1-pass method clearly fall within the range of estimates from the iterative method, 
indicating a need for further research on this topic. 

Fraction of Demand Assigned by Each Increment—The accuracy of assignment re- 
sults is related to the amount of traffic assigned to each path and the fraction of the 
total traffic demand assigned between network modifications. This fraction in turn is 
controlled by the volume originating at each centroid and the number of centroids. The 
procedure, therefore, allows the user to choose the number of trees built between 
modifications. 


95 


Figure 2 shows the accumulation of volume for a typical link located centrally within 
a small network. The volume buildup is shown for assignments loading 5 and 20 trees 
respectively between network modifications. Results obtained in each case are quite 
similar, although a more reliable average speed estimate would have been expected 
from the loading based on 5-tree increments. 

The program has been prepared with an additional routine to decrease the fraction 
of traffic demand loaded at each increment whenever the accumulation of flow exceeded 
a set level between successive network modifications. This procedure evaluates, after 
each modification, the average factor by which link times have been modified. When- 
ever the difference between this factor and the one obtained in the previous increment 
reaches the set level, the number of trees is decreased by a percentage calculated 
from a parabolic function of the difference. 


Validity of Individual Trees 


The 1-pass incremental procedure, in addition to its efficient operation, generates 
trees that follow realistic and logical routes and can be used for subsequent purposes 
such as analysis of selected links and accumulation of service parameters along their 
paths. As a result of the randomly selected loading sequence, most links start re- 
ceiving flows during early increments. In a typical run with 19 increments, more 
than 45 percent of all links showed a flow with only 8.2 percent of the total demand 
loaded. With 20.2 percent of the volume processed, almost 67 percent of all links had 
received flows; and with 39.1 percent of the total demand loaded, more than 80 percent 
of all links showed flows. 

The multiple-routing procedure, as illustrated in the 100-centroid example (Table 
3), will load flows on competing routes at an even pace and thereby ensure the smooth- 
ness of the 1-pass capacity-restraint method and the validity of its individual trees. 

Comparison of Highway Service Estimates—Service estimates accumulated along the 
paths generated by the capacity-restraint procedure were verified by comparing zone- 
to-zone travel times and travel distances with estimates obtained along the minimum- 
time paths of the average network. Values accumulated along the capacity-restraint 
paths were expressed as a percentage of those obtained from minimum-time paths and 
grouped according to 6 value ranges. 

The data from a 165-centroid network with a high degree of congestion showed 
travel-time values that were 1 percent higher on the average for capacity-restraint 
paths. More than 75 percent of all capacity-restraint paths had travel-time values that 
were within 2 percent of minimum-path times. Only 12 paths contained differences 
greater than 5 percent with extreme values of 94 percent and 114 percent respectively. 

Distance values accumulated along capacity-restraint paths were found to be 3 per- 
cent higher than those along the minimum-time path. However, this does not neces- 
sarily cause concern because assignments based on minimum-time paths have been 
found to underestimate vehicle-miles of travel. Table 4 gives the frequency of capacity- 
restraint paths by percentage difference of travel-time and travel-distance values. 

Of more interest perhaps than the variation in average values is the occurrence of 
differences within the loading sequence of the capacity-restraint procedure. Figure 3 
shows a plot of the average percentage difference of time and distance for each group 
of 10 successive paths. The plot of travel-time differences seems to indicate a slight 
but definite trend toward increasing values for the paths generated in the second half 
of the loading sequence. The same trend, but much more pronounced, is found for the 
distance values. 

Detailed analysis shows that group 16, for which the accumulated distances over 
the capacity-restraint paths are 10 percent higher than those of the minimum-time 
path, contains 2 of the 3 most extreme paths. Another extreme path is located in group 
15 and, like the other two, originates from a zone bounding the external cordon. Fur- 
ther investigation of the few extreme paths has not yet been undertaken, but a prelim- 
inary analysis indicates that peculiar network conditions may be the major cause of 
the differences. The findings suggest, however, a definite need for further detailed 
analysis, including plots of individual path traces. 


Figure 2. Progression of V/C for loading of centrally located link for different numbers 
of trees built between network modifications. 
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Table 4, Frequency of capacity-restraint 


Travel Travel Travel Travel 
paths by percentage difference in time Percent of Time Distance Percent of Time Distance 
and distance. Minimum {number (number Minimum (number (number 

Path of paths) of paths) Path of paths) of paths) 
<95 bj 2 104 8 10 
95 0 1 105 5 9 
96 0 2 106 5 9 
97 2 3 107 el 10 
98 0 1 108 2 3 
99 6 16 109 1 3 
100 42 17 >109 4 g 
es . a Total 165 165 
103 9 16 Mean 101.4 103.1 


Figure 3. Percentage difference in accumulated time and distance for capacity-restraint path by 
loading sequence groups. 
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CONCLUSIONS 


The 1-pass incremental capacity-restraint program presented in this paper pro- 
vides a flexible and economical tool for the estimation of highway service measures on 
congested networks. Computer costs experienced with this method indicate that stable 
estimates of speeds and flows can be obtained at costs that are, at most, 40 percent 
more than those required for a minimum-path all-or-nothing assignment. This com- 
pares with an increase of 350 to 400 percent when other capacity-restraint methods are 
used, not counting the cost to generate final sets of trees that properly reflect the level 
of congestion. 

A number of areas have been identified that suggest the need for additional research 
including the calculation of average speeds for congested links and the detailed analysis 
of individual tree traces in relation to the loading sequence and the level of congestion. 

The 1-pass procedure provides an efficient and realistic method to obtain service 
estimates for the evaluation of networks and modal choice under conditions of conges- 
tion. Because this procedure does not require cumbersome sequencing of many dif- 
ferent programs, it should be of interest to those concerned with analyses and evalua- 
tion of alternative transportation networks. 
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ACTIVITY-ACCESSIBILITY MODELS OF TRIP GENERATION 
T, Z. Nakkash and W. L. Grecco, Joint Highway Research Project, Purdue University 


This paper examined the effect of activity-accessibility variables on trip 
generation. A second aspect of accessibility was studied by stratifying 
the zones of the study area by location. Data from the transportation study 
in Indianapolis were used to develop 4 sets of trip-generation regression 
equations for each of 13 trip purposes. A comparison of the developed 
models revealed that location always improved the statistical strength of 
the trip-generation models and relative accessibility variables improved 
the statistical strength of trip-attraction models more than that of trip- 
production models. Stratification alone improved the models more than 
relative accessibility variables only. 1985 forecasts of demographic, 
socioeconomic, and land use variables together with estimates of travel 
time on the proposed future network were inputs to the 4 sets of developed 
models to forecast trip productions and attractions. More trips for zones 
of the noncentral area and fewer trips for zones of the central area were 
forecast by stratified models than by unstratified models. 


®IN recent years urban transportation planning activities have increased in many 
American cities. This increased activity was in response to the challenging urban 
transportation problem, one of the major problems of contemporary cities. The safe 
and efficient intracity movement of goods and people is very essential for the social, 
cultural, and economic health of an urban area. 

The urban transportation problem is the product of many interacting factors. The 
enormous population growth in urban areas and their expanding areal extent as a result 
of the redistribution of population, the improved standard of living due to increased af- 
fluency, and the subsequent greater reliance on private automobiles are only some of 
those factors. Together with those size-related features of the problem, the temporal 
aspects induce periodic high demands for transportation. This, of course, is due to 
the interdependency of human activities that occur essentially during the 8-hour work- 
day, starting and ending at rather definite times. 

A recognition of the immense complexity and the vast dimensionality of the urban 
transportation problem is a prerequisite of any attempt to solve the problem. 

Apart from the socioeconomic, demographic, and land use forecasts, trip genera- 
tion constitutes the first step toward establishing the future travel pattern. The ac- 
curacy of the future trip distribution in forecasting design-year trip interchange cannot 
be any better than the accuracy of the trip-generation forecasts. 

The ultimate purpose of the trip-generation analysis is to arrive at an estimate of 
the trip ends generated at each analysis unit of the study area. Trip-generation tech- 
niques try to establish a relation between the demographic and socioeconomic char- 
acteristics of the population of an analysis unit and its trip generation. Similarly, the 
intensity, character, and location of different land uses are related to trip-making of 
the analysis units. These procedures are based on the hypothesis of a causal relation 
between population characteristics, land use, and trip-making behavior of people. 

Traditionally, trip-generation forecasts are established independently of any direct 
consideration of the transportation network. This, of course, assumes that trips pro- 
duced at or attracted to a zone are a function only of the attributes of the zone itself 
and are not directly a function of the transportation network on which the trips were 
made. The traditional trip-generation process is shown in Figure 1. 


98 


99 


PURPOSE AND SCOPE 


The purpose of this research was to study the trip-generation process and specifi- 
cally to investigate the effect of the transportation system on the rate of trip-making. 
Conceptually, there is no strong basis for assuming that trip-making is independent of 
the transportation system. On the contrary, it seems that trips produced by or attracted 
to a zone should be a function of the relative accessibility of the zone to different land 
uses in addition to the characteristics of the zone itself. 

Trip-making is a product of the desire for human interaction and the necessity for 
having to perform different daily activities at different locales. Basically, the rate of 
trip-making is a function of 2 categories of variables: One tends to increase the po- 
tential of trip-making, and the other tends to restrict it. The availability of vehicles to 
the residents of a zone, the percentage of the residents in the labor force, the number 
employed in a zone, and the amount of floor area of different land uses are examples 
of the first category of variables. They measure the potential of trip production or 
trip attraction. The penalties incurred by travel measured in cost, travel time, or 
travel distance are variables that belong to the second category. 

This study utilized data obtained from the surveys for the Indianapolis Regional 
Transportation and Development Study (IRTADS). Multiple linear regression predictive 
models of person-trip productions and attractions by purpose were developed. The de- 
veloped models differ from the traditional trip-generation models. The independent 
variables were not restricted to socioeconomic and land use measures of the zones but 
included also measures of the relative accessibility of the zone to different activities 
and land uses. 

The locational aspects that affect trip generation were also investigated. It was hy- 
pothesized that central locations in the study area, generally, afford greater accessi- 
bility; and the convergence of the street network on the city center favors the core lo- 
cation. The zones of the study area were stratified into2 groups: central and noncentral. 
This stratification was entered as an independent dummy variable in the trip-generation 
analysis. 

A comparison was made of the forecast trip generation by the suggested approach 
and the forecast by the traditional approach. 


ACTIVITY-ACCESSIBILITY CONCEPT IN TRIP GENERATION 


As stated earlier, the rationale of trip generation has been based on the demographic, 
economic, and land use characteristics of the zone; no consideration has been given to 
the status of the transportation system. This research was based on the concept that 
trips generated by a zone of the study area are also a function of the status of the trans- 
portation subsystem that serves a zone and connects it to the other zones of the study 
area. The effect of the transportation system on trip generation was investigated in 
the light of the relative accessibility of each zone to various urban activities and of the 
spatial relation of the different zones to each other. 

There are many instances where researchers have realized the existence of a feed- 
back from the transportation system to the trip-generation phenomenon (7, p. 18; 6, pp. 
201-202; 11, p. 75; 1, p. 66; 15, p. 98; 9, p. 166; 2, p. 38; 3, pp. 73-74; 10; and 8). In 
spite of the large number of references noted above, actually little has been done to 
measure those effects. 

The hypothesis proposed by this research is that the number of trips generated by 
a zone is a function of the transportation subsystem that connects the zone under con- 
sideration with the other zones of the study area. For this purpose, the term "relative 
accessibility" will be defined conceptually and operationally. 

It was hypothesized earlier that the trips produced by or attracted to a zone are a 
function of the causal or symptomatic variables modified by the relative ease in over- 
coming space between that zone and all other zones. Zones with relatively more ac- 
cessible destinations should, in general, produce more trips; similarly, zones that 
are relatively more accessible to origins should, in general, attract more trips. The 
term "relatively'' refers to the zone under consideration as compared to all other zones 
of the study area. This implies a competitive consideration among zones in generating 


Figure 1. Traditional trip-generation process. 
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Table 1. Dependent variables for which trip-generation equations were 


developed. 


Person-Trip Production 


Person-Trip Attraction 


Purpose Number Purpose Number 
Home-based work 1 Home-based work 8 
Home-based shop 2 Home-based shop 9 
Home-based school 3 Home-based other 10 
Home-based other 4 Non-home-based work a 
Non-home-based work 5 Non-home-based nonwork 12 
Non-home-based nonwork 6 Total 13 
Total 7 


Figure 2. Generated relative accessibility 
variables. 
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trips. Zones of similar-sized activities will attract trips differently according to 
their locational and accessibility advantages. 

As a measure of the ease or difficulty of overcoming space, this study used the set 
of friction factors developed from the calibrated gravity model of trip interchange for 
the study area. This had the advantage of avoiding the use of a constant exponent of 
distance or time. Friction factors as developed by the calibration of the gravity model 
are a function of travel time and are classified by trip purpose. 

This study's definition of relative accessibility is a modification of Hansen's. In 
notational form, relative accessibility is computed as follows: 


n 
Ax. x(g) = 1 Six F43(g) 
j= 


n n n 
RA; .4(g) = Ar. «|| ). ¥ Six Fa) + 100 = Ay. x(g) | i, Ay. «(4)|* 100 
i=l j=l i=l 
where 

i = zone under consideration (i = 1, 2,..., n); 

j = any zone in the study area, including zone i (j = 1, 2,...,i,..., m); 

k = activity under consideration (k = 1, 2,..., m); 

4 = trip purpose (4=1, 2,..., p); 

S;, = size of activity k in zone j; 
F,,(g) = friction factor corresponding to the travel time from zone i to zone j for 


purpose 4; 
A,, «(2) = accessibility of zone i to activity k for purpose 4; and 
RA,, «() = relative accessibility of zone i to activity k for purpose 4. 


The value of the relative accessibility of zone i to activity k for purpose 4, as itis 
possible to infer from its formulation, could be different for a future year if any or 
all of the following study area parameters change: 


1. Interzonal travel time, consisting of interzonal driving time, terminal time, 
and intrazonal time; or 


2. Size of activity k in any or all zones of the study area. 


The value of the relative accessibility of a zone to the same activity could be dif- 
ferent for different trip purposes. The reason is that the friction factors, correspond- 
ing to a certain travel time, are usually different for different trip purposes. 

Another aspect of accessibility, directly related neither to travel time nor to size 
of activity, is considered. The relation to trip-generation characteristics is investi- 
gated by stratifying the zones of the study area into 2 sets: central and noncentral. 
The conceptual basis is that "central sites afford maximum accessibility..." (2, p. 
108). The central area is also, more or less, equally accessible to the various zones 
of the study area because of the convergence of the street system on the city center. 
This stratification introduces a qualitative factor describing the general arrangement 
of the land uses and the configuration of the street system. 


STUDY DESIGN AND ANALYTICAL PROCEDURES 


The following methodology was used in this investigation. Trip-generation models 
that take into account accessibility variables were developed from data of an operational 
transportation study. Those models were then compared with the conventional models 
developed as part of the transportation study. Both sets of models were used to fore- 
cast 1985 trip generation. The 2 sets of forecasts were compared by testing for any 
significant differences, on a zone-by-zone basis, between the 2 forecasts. 

Models that take into consideration the stratification of the zones of the study area 
into central and noncentral sets were also developed. This stratification was investi- 


gated for the models developed by the transportation study and those developed by this 
investigation. 
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The main purpose of this investigation was to compare the sets of developed models 
that used accessibility variables and stratified zones into central and noncentral sets 
with the traditional models developed by an operational transportation study. Care was 
taken to keep any factors that might disturb the comparison out of the developed models 
so that the comparisons would be most valid. Furthermore, the decision to develop 
multiple linear regression models of trip generation by using data summarized by zone 
was mainly in the interest of keeping the results of this investigation comparable with 
those of the operational transportation study. 


Data Preparation 


The data used in this investigation were obtained in the IRTADS surveys. The study 
was at a stage where most, if not all, of the analyses were completed, the forecasts 
established, and the proposed networks evaluated. In the trip-generation analysis, all 
the models were developed for total person trips (except for truck and taxi equations). 
The dependent variables were in the form of productions and attractions suitable for 
distribution by the gravity model. Nineteen trip-generation equations were developed 
by IRTADS; 2 were for truck and taxi trip ends, 4 were for control totals, and the 
others were for person-trips productions and attractions by purpose. 

Dependent Variables—This investigation was limited to 6 trip-production purposes, 
5 trip-attraction purposes, and 2 control totals (1 for all productions and 1 for all at- 
tractions). Trip-generation equations were developed for the dependent variables 
given in Table 1. It was not possible to develop an equation for home-based school 
person-trip attractions because the key independent variable, school enrollment, was 
not available. 

Socioeconomic and Land Use Variables—Twenty-nine socioeconomic and land use 
variables were originally considered by IRTADS in its trip-generation analysis; only 
15 were, however, eventually retained in the final equations. Moreover, only 10 of 
these were available for this study for both the survey year and the forecast year. The 
following socioeconomic and land use variables were available and used by this study 
as independent variables in developing multiple linear regression models of trip gen- 
eration: total employment, retail employment, service employment, retail floor area, 
educational floor area, dwelling units, labor force, population, cars, and single-family 
dwelling units. 

Accessibility Variables—Different measures of relative accessibility to be used as 
independent variables in trip-generation regression models were established. The op- 
erational definition of relative accessibility stated earlier was used. The required in- 
puts are the size of various activities in each zone and skim zone-to-zone friction factor 
trees. 

Size of Activities—The definition of activity was extended for this purpose to include 
all the 10 socioeconomic independent variables that were available from the IRTADS 
surveys. If the information had not been available from that source, the size of activi- 
ties would have been collected from various sources, e.g., employment data from the 
State Employment Securities Division, floor area information from the land use survey, 
and other data from the home interview survey or census records. 

Skim Zone-to-Zone Friction Factor Trees—A set of friction factors for each of 6 
trip purposes was available from the results of the IRTADS calibrated gravity model. 

A binary zone-to-zone tree tape was made available by IRTADS for this investiga- 
tion. This binary tape was updated (intrazonal and terminal times added) and skimmed 
to give skim zone-to-zone travel time binary trees. Six skim zone-to-zone friction 
factor trees were built, 1 for each of 6 trip purposes. 

Only the highway network was considered in developing relative accessibility vari- 
ables. Although the trip-generation models were for person trips, using the highway 
network only would not introduce any appreciable bias in the case of IRTADS mainly 
because transit passenger trips constituted only 4.1 percent of all the person trips (8). 
Moreover, the transit in IRTADS area was entirely bus service on the city streets. 

Generated Relative Accessibility Variables—With 10 activity measures (the available 
independent variables) and 6 sets of friction factors (1 for each trip purpose), 60 mea- 
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sures of relative accessibility could be generated. However, not all 60 possible com- 
binations were generated; only those that were meaningful to the trip generation analysis 
were used. For example, the relative accessibility of a zone to retail floor area could 
be meaningful in conjunction with home-based shopping person-trip productions. Also, 
the relative accessibility of a zone to single-family dwellings would be meaningful in 
conjunction with home-based shop person-trip attractions, Twenty relative accessi- 
bility variables were generated and considered in the analysis; these are shown in 
Figure 2. The same measure of relative accessibility could be meaningful in conjunc- 
tion with both the productions and attractions of some of the trip purposes. 


Delimiting the Central Area 


As indicated earlier, this investigation considered stratifying the study area into 
central and noncentral areas. It was assumed that the central and noncentral areas 
might reflect 2 different trip-generation patterns because of the shape of the study 
area, its historical quasiannular urban growth, and the configuration of the transpor- 
tation system. This differentiation of the central and noncentral areas was categorical 
instead of numerical and could best be treated through stratification. 

The rationale behind the procedure developed to delimit the central area was tied to 
the expected character and attributes of a central area. A large proportion of the land 
in a central area is expected to be in urban use. A relatively small proportion of the 
land in a central area is expected to be devoted to residential uses. Conversely, a 
relatively high proportion of the land in the central area was expected to be in uses 
that are known to seek central location. 

Land was considered to be in urban use if it did not belong to any of the following 
classifications: quarry and mining, automobile junkyard, water, agriculture, or 
vacant. The percentage of urban land in residential use was the measure of the in- 
tensity of residential activity. The percentage of land in urban use was the measure 
of urbanity. Among the different trade and service uses on which IRTADS had floor 
area information, the following were chosen as those that seek a central location: 
wholesale trade (without warehousing); general retail trade; automobile retail; apparel, 
furniture, and appliance retail; retail use not otherwise classified; finance, business, 
and professional services; contract construction services; governmental services; 
personal services; and services not otherwise classified. Educational services were 
excluded because schools do not necessarily seek central locations. The floor area 
of each use in hundreds of square feet per acre of land in urban use was calculated for 
each district. This ratio measures not only the amount but also the intensity of use. 

The measures given above were used to set the following conditions for delimiting 
the central areas: 


1. The delimitation should be performed at the district level (a district is a con- 
tiguous group of zones). 

2. The central area should probably include all of the central business districts 
and some of the qualifying surrounding districts. 

3. The districts of the central area should all be contiguous and connected. 

4. A qualifying district must be at least two of these: in the lower quartile of all 
the districts of the study area in percentage of urban land in residential use, in the 
upper quartile of all the districts of the study area in the percentage of land in urban 
use, or in the upper quartile of all districts of the study area in the ratio of hundreds 
of square feet of uses usually seeking central location to acres of urban land in each 
district. 


The study area was stratified into central and noncentral areas (Fig. 4). The districts 
of the central area comprised 105 zones out of the 395 in the study area. 


MODEL BUILDING 
Guidelines for Model Building 


Multiple linear regression models of trip generation were developed by using the 
computer program BMD-2R, stepwise regression. In addition to the desired sta- 
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tistical qualities of the developed models, other important factors were also con- 
sidered. 

Conceptual Validity—The consideration of relative accessibility was mainly to achieve 
a sounder conceptual basis for trip generation. In addition, only independent variables 
that were logically related to the specific dependent variable under consideration were 
allowed to enter when regression equations were developed for that dependent variable. 
The causal-logical relation was considered prior to the mere statistical correlation 
analysis. Association and correlation do not prove causality; causality should only be 
hypothesized on theoretical or conceptual grounds. 

Model stability is one of the desirable products of conceptual validity. Relations that 
are not conceptually valid, if established from today's data, are more apt not to hold in 
the future. Predictive equations of trip generation should hold for the future to have 
any forecasting capability. 

Another facet of conceptual validity is the sign of the regression coefficient. Because 
of collinearity in the variables, the coefficient of one of the independent variables could 
be contrary to the theoretical relation, and this condition might be statistically accept- 
able. In spite of this, it was decided to delete those variables whose coefficients had 
signs contrary to conceptual expectations. This should increase the statistical validity 
of the model as it tends to reduce the effects of collinearity. 

Simplicity—The models were kept as simple as possible by avoiding unnecessary 
transformations of and interactions among the original independent variables. Inter- 
actions beyond the product of 2 independent variables were considered difficult to in- 
terpret and thus were avoided unless the third variable of the product was the dummy 
variable defining the location of a zone in the central or noncentral areas. 

Keeping the structure of the model as simple as possible by not going to higher order 
interactions curtails the propagation of measurement errors. Another aspect of sim- 
plicity is parameter parsimony. Although it is valuable to include all relevant inde- 
pendent variables and thus reduce specification errors, it is doubtful that it would be 
advantageous to do so when, as is the case for transportation studies, the input data 
are inherently plagued by measurement errors. As an emphasis of this research, the 
number of independent variables in the model was kept to a minimum. 

Stability—So that the developed models would be stable during a time period, the 
prerequisite of allowing a variable to enter the model was a hypothesized causal rela- 
tion rather than a mere correlation. 

Stability was also sought over the range of the values of the independent variables. 
This could be quite a difficult criterion to account for during model building. A study 
of the range of the independent variables for the forecast year was undertaken, and 
possible problem zones were identified. Recommendations will be made to ameliorate 
this condition. 

Sensitivity—It is desirable that the response of the dependent variable be sensitive 
to changes in each of the independent variables in the model. The cost of adding 1 more 
independent variable would not be justifiable if the dependent variable is not sensitive 
to changes in the added independent variable. The sensitivity of the dependent variable 
to each of the independent variables in the model was tested by calculating the standard- 
ized regression coefficients (the regression coefficients multiplied by the ratio of the 
standard deviation of the independent variable under consideration to the standard de- 
viation of the dependent variable). 


Statistical Considerations 


Stepwise Regression—The computer program used by this research, as mentioned 
earlier, was BMD-2R, stepwise regression. Several procedures are available to de- 
velop multiple regression models. The "tear-down" or "backward elimination" method 
starts with a model containing all the available independent variables and subsequently 
eliminates some of the independent variables until a mod’ with predescribed statistical 
features is reached. The "build-up" or "forward selection" procedure strives for a 
similar final outcome but works in the opposite direction by inserting 1 more inde- 
pendent variable at atime. Stepwise regression is an improved version of forward 
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selection procedure. The independent variables in the model are reexamined at the end 
of each step. The variable that might have been the best single variable to enter at an 
earlier step might prove to be unnecessary at a later stage because of the relation be- 
tween it and other variables now in the equation. Thus, at each step, the partial F-test 
for each variable in the equation was evaluated and compared to a preselected percent- 
age point of the appropriate F-distribution. Stepwise regression evaluates the contri- 
bution of each independent variable in the model at the end of each step, regardless of 
whether the independent variable has entered at the last step or at any earlier step. 

Partial F- or Sequential F-Test—By far, the most important statistic in conjunction 
with regression analysis is the multiple coefficient of determination, R*. It measures 
the proportion of total variability in the dependent variable explained by the regression 
model. R? varies between 0 and 1: A value of 0 indicates a complete lack of fit, while 
a value of 1 implies a perfect correlation. In stepwise regression, a test is needed at 
each step to check whether the increase in R’ contributed by each added independent 
variable in the equation is significantly different from 0, The following F-statistic 
tests whether the contribution of the k independent variables is significantly greater 
than 0. 


Fy n-k-1 = (Ri/k/ [1 - R2/n -k - 1)) 


where 
n = number of observations, 
k = number of independent variables, and 
R? = coefficient of multiple determination of a model with k independent variables. 


The calculated F-statistic is compared to a tabulated Fy, ny, 1-@, where @ is the 
probability of a type I error, or the level of significance. The level of significance 
chosen should depend on the consequences of rejecting a true hypothesis. The level 
of significance was set at 0.010 for including a variable and at 0.005 for deleting a 
variable. The selection of these values is based on acceptance of a relatively high 
risk of including a variable that does not belong. Once this variable has been accepted, 
there is a lower risk acceptable for its retention in the equation based on the entry of 
other independent variables. 

The blind use of the F-test may result in the development of a regression model 
that involves more independent variables than are of practical significance. In trans- 
portation studies, the number of observations is large and results in an F-statistic 
that is statistically significant even when the absolute increase in R’ is very small. 
The criterion of a significant increase in R’ proved to be superfluous in most cases; 
other criteria such as simplicity, parsimony, and reasonableness controlled the num- 
ber of variables to be included in the model. 

Standard Error of Estimate—Another statistic of interest is the standard error of 
the estimate, s. It is the square root of the residual mean square. The smaller the 
value of this statistic is, the more precise the predictions will be. The criterion of 
reducing s must be used cautiously because s can be made small by including enough 
parameters in the model, just as R® can be increased. As more independent variables 
are included in the equation, the decrease in s will be at a decreasing rate. Reduction 
of s is desirable if many degrees of freedom for error are remaining. 

Another way of looking at the reduction in s is to consider it in relation to the de- 
pendent variable, namely, as a percentage of the mean value of the dependent variable. 
Standard error of estimate as a percentage of the mean of the dependent variable is re- 
ferred to as the coefficient of variation. 

t-Test on Regression Coefficients—It is sometimes desirable to test whether each of 
the estimated regression parameters is significantly different from 0. The ratio of 
each regression coefficient to its standard error is distributed as student-t. If the re- 
gression coefficient of one of the independent variables does not pass the t-test, it can 
be deleted from the equation. 

The 3 criteria of R*, s, and significance of the regression coefficient are not inde- 
pendent. Usually, the decision can be made on the basis of R’ alone. 
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Model Identification 


The first set of models that was developed by this investigation was a rerun for each 
of the 13 dependent variables using the same independent variables established by 
IRTADS for its equations (3). In the interest of compatibility and comparability, data 
from all the 395 zones were used to reestimate the parameters of the models developed 
by IRTADS. Those models, essentially developed by IRTADS, were used as a basis 
for comparison with other developed models. 

A second set of models that include relative accessibility variables was attempted 
for each of the 13 dependent variables. The first 2 sets of models were developed with 
data from the 395 zones with no distinction relative to location in the central or non- 
central areas. Two more sets of models were developed: one corresponding to the 
set developed by IRTADS and the other to the set of models developed by this investi- 
gation. These latter models contained a dummy variable defining the location of a 
zone in the central or noncentral areas. 

Basically, 4 sets of models were developed. Two had no relative accessibility vari- 
ables among their independent variables: one of those, set W-U, was developed by the 
traditional procedures for IRTADS and the other, set W-S, contained a dummy variable 
that defined the zone location or some of the interaction of the dummy variable with the 
other independent variables in the equation or both of these. Of the remaining 2 sets, 
set A-U had relative accessibility variables, and set A-S also had relative accessibility 
variables and was calibrated with stratified data. Figure 3 shows a system for the 
identification of the developed models. It was not possible to develop models for each 
dependent variable in every set, as indicated. 


COMPARISON OF THE DEVELOPED MODELS 


The statistical strength of the models was compared by comparing their corre- 
sponding coefficients of multiple determination. The model sets compared to determine 
improvements by the actions indicated were as follows: 


1. A-U versus W-U, introducing relative accessibility variables to the basic 
IRTADS models; 

2. W-S versus W-U, calibrating the models with data stratified according to the 
zone location over the basic IRTADS models; and 

3. A-S versus W-U, introducing both relative accessibility variables and calibrat- 
ing the model with stratified data over the basic IRTADS models. 


The results of the statistical tests of the significance of the increase in R* of each de- 
veloped model over the corresponding basic IRTADS model are given in Table 2. Com- 
parative summary statistics of all developed models are given in Table 3. 


CONCLUSIONS 
Based on the preceding results and analyses, the following conclusions can be drawn: 


1. Among all of the relative accessibility variables considered, the following vari- 
ables were included in the trip-generation models that were developed: accessibility 
to employment in conjunction with home-based work productions and non-home-based 
work attractions; accessibility to labor force in conjunction with home-based work at- 
tractions; accessibility to single-family dwellings in conjunction with home-based shop 
attractions and home-based other attractions; and accessibility to educational floor area 
in conjunction with home-based school productions and non-home-based nonwork at- 
tractions, The preceding accessibilities were each calculated with the friction factor 
corresponding to the same trip purpose as the model under consideration. 

2. Relative accessibility variables in trip-generation models improved the statis- 
tical strength of models of person-trip attractions more than that of models of person- 
trip productions. Competition is a more important locational consideration for high- 
attraction zones, which indicates their need for greater accessibility. 

3. Calibrating trip-generation models with data stratified according to the location 
of the zone in the central or noncentral areas always improved the statistical strength 


Figure 3. Trip-generation models developed. 
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Table 2. Results of comparisons of sets of models. 


A-U W-s A-58 A-U w-s A-S 
Trip Versus Versus’ Versus Trip Versus Versus Versus 
Purpose W-U W-U W-U Purpose W-U W-U W-U 
1 N Ss * 8 N N 
2 * * + 9 ++ Ss 
3 8 s s 10 8s Ss 
4 * + * 11 acl 5 
5 bs s ** 12 + S 
6 + s ¥* 13 + S 
7 * s ** 


Note: N = increase in R? is not significant at a = 0.0005, S = increase in R? is significant at a = 0.0005, * = no 
satisfactory models were developed, and ** = models were developed but no statistical testing was possible. 


Table 3. Comparative statistics for all sets of models. 


Coefficient of Multiple Number of Independent 
Trip Determination Coefficient of Variation (percent) Variables in Model 
Pur- —_-_ ee —_—_—_—__ 
pose Ww-U W-s A-U A-S Ww-U W-S A-U A-S W-U W-S A-U A-S 
1 0.974 0.975 0.977 * 16.30 16.00 15.52 * 1 2 3 ¥ 
2 0.887 * * « 38.38 ~ * 33 2 * * * 
3 0.630 0.659 0.670 0.699 80.28 77.14 75.95 72.66 7. 2 2 3 
4 0.903 of * * 31.99 * * ae * = > 
5 0.748 80.767 0.748 0,790 53.83 51.89 53.89 49,41 4 ff 5 9 
6 0.650 0.806 0.643 0.797 70,43 52.69 71.10 53.91 5 8 5 9 
7 0.961 0.965 0.958 0,962 17.89 17.15 18.61 17.83 4 6 4 6 
8 0.839 0.840 0.840 0.842 54.93 54.74 54.72 54.54 1 2 2 3 
9 0.442 0.705 0.553 0.639 158.64 115.63 141.99 127.92 2 4 2 4 
10 0.679 0.716 0.682 0.719 53.05 50.04 52.90 49,94 5 i 6 9 
11 0.738 0.757 0.739 0.780 54.01 52,22 54.15 49.84 4 8 7 10 
12 0.636 0.799 0.645 0.801 73.67 55.05 72.90 54.70 5 8 6 7 
13 5 8 5 9 


0.771 0.844 0.801 0.848 46.68 38.71 43.58 38.27 


Note: * = no satisfactory model developed. Degrees of freedom ranged from 382 to 392. 


Figure 4. Noncentral area zones where basic IRTADS models 
underforecast trips. 
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Figure 5, Proposed trip-generation process, 
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of the models whether the models had accessibility variables or not. Models of home- 
based work person-trip productions or attractions were improved least by including 
relative accessibility variables or stratification or both. This is expected because 
work trips are inelastic to trip length because of their regularity and necessity. Fur- 
thermore, it indicates substantially similar attracting characteristics for work trips 
by zones in the central and noncentral areas. 

4. In general, the statistical strength of the developed models was better achieved 
by stratification alone than by including relative accessibility variables only. 


The 4 sets of developed models were solved with the 1985 forecast values of the in- 
dependent variables. The forecasts were analyzed to identify comparative forecasting 
trends of the different models. The following conclusions were drawn. 


5. It was observed that stratified models consistently forecast more trip productions 
and attractions for zones of the noncentral area and fewer for zones of the central area 
than models without stratification. Stratified models are thus sensitive to the situation 
of equilibrium and saturation being reached in the central area and also the faster rate 
of traffic growth in the noncentral area. 

6. 1985 forecasts of person-trip productions and attractions by models that had 
relative accessibility variables and that were calibrated with stratified data were sig- 
nificantly different from forecasts by basic IRTADS models. There was not a detectable 
trend as to the sign of the mean difference between zones of the central and noncentral 
areas. Further analysis indicated that stratified models with relative accessibility 
variables forecast more productions and attractions than were forecast by basic IRTADS 
models, in general, for zones located in the vicinity and along corridors defined by the 
major thoroughfares of the study area. This reflects a possible locational aspect of 
trip generation in addition to the central-noncentral stratification. This is shown in 
Figure 4. 


The trip-generation models proposed by this research are functions of the status of 
the transportation system. In an operational transportation study, future forecasts of 
trip generation would then be affected by the nature of the proposed transportation net- 
work. Because the proposed network should be designed to serve future trip genera- 
tion, an iterative process should be followed. It would be terminated when an equilib- 
rium between the future supply of transportation (proposed plan) and the demand for 
transportation (travel forecast) is reached. This iterative process is shownin Figure 5. 
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COMPARISON OF PROBABILISTIC MODAL-CHOICE MODELS: 
ESTIMATION METHODS AND SYSTEM INPUTS 


Antti Talvitie, School of Civil Engineering and Environmental Science, 
University of Oklahoma 


Twelve models were formulated by segmenting the total travel time and to- 
tal travel cost by rapid transit and by automobile in different ways or by 
leaving them out completely and including only socioeconomic variables in 
the model. These models were then estimated by using logit, probit, and 
discriminant analyses. The results were evaluated in 2 respects: Are 
there differences in performance among the methods of estimation? and 
Are there differences in performance among the 12 model specifications ? 
The results indicate that there are no statistically significant differences 
either among the methods of estimation or among the model specifications 
themselves. A model that uses only 2 user characteristics, income and 
the number of working household members, and 1 system-related variable, 
a dummy variable for walk access to the transit station, performs no worse 
than a model that uses a whole set of system characteristics in addition to 
those 3 variables. Values of time significantly lower than those previously 
reported were found; the "best"! estimate in this study is only 12 percent of 
the wage rate. 


e THERE ARE 3 objectives in this research: (a) to investigate the relative merits of 
the 3 methods most widely used in probabilistic modal-choice modeling or in value of 
time studies, logit, probit, and discriminant analyses (1, 2, 3, 4); (b) to investigate 
the need for trip segmentation, that is, Is there a need to differentiate among access, 
egress, and line-haul times and costs? and if so, How should this trip segmentation 
be done ?; and (c) to obtain further evidence on the value of time. 

Answers to the first objective are provided by estimating several different models by 
all 3 methods and then by assessing and comparing the forecasting accuracy of each 
method. For the second objective, these different models were designed by segmenting 
the travel costs and travel times in different ways. The forecasting accuracy of each 
model specification was then assessed and compared with the forecasting accuracy of 
the other model specifications. The third objective was accomplished as a by-product 
from several different models estimated by objectives 1 and 2. 


BASIC MODEL AND METHODS OF ESTIMATION 


A general probabilistic travel demand model can be expressed as 
Pr (ij, M) = Pr (ij) * Pr (M|ij) (1) 


where 


Pr (ij, M) = probability that the event (ij, M) occurs, that is, an individual makes a 
trip between points i and j by using mode M; 
Pr (ij) = probability that an individual makes a trip between points i and j; and 
Pr (M\ij) = probability that an individual uses mode M, given that he makes a trip 
between points i and j. 


Clearly, in modal choice, models are estimated for Pr(M|ij). In this research only 
binary choice, automobile versus rapid transit, is considered. 


1il1 
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It is assumed that there is an index y that determines to which group (automobile or 
transit) an individual is likely to belong. This index y is constructed as follows: 


y =a+b (Lf) - Li) +¢ (SE) 


where 


constant term; 
(1 x k) vector, where k is the number of system characteristics; 
(1 x £) vector, where £ is the number of user characteristics; 
(k x 1) vector of system characteristics describing the level of service 
between points i and j by mode M; 
SE = (4 x 1) vector of user characteristics; and 
a, b, c = coefficients to be estimated. 


Fe ome) 
nouwou i 


Verbally expressed, it is hypothesized that modal differences in the level of service 
(i.e., differences in travel times) combined with the user characteristics are the de- 
terminants of choice of mode. [Also the ratios of system characteristics can be used 
(1, 5, for example).] 

In the logit model, the probability that choice of mode M, the dependent variable, 
will equal 1 (automobile choice is denoted by 1, and transit ‘choice is denoted by 0) is 


Pr(M = 1|ij) = e’/(1 + e”) (4) 
and similarly 
Pr(M = Ojij) = 1 - Pr(M = 1|ij) =1/(1 + e”) 
No assumptions are needed about the distributions of the variables or of y. 
The probit model uses the same linear function y. If, for any given individual, 


Y2Yortt, then M =1; andify < yer, then M=0. The assumption is made that yori: 
is normally distributed over the population. The probability that M will equal 1 is 


y 2 
Pr(M = 1|ij) = Pr(yerte = y|ij) = 1A 2m eA at (5) 
-cC 
and similarly 
Pr(M = Ojij) = 1 - Pr(M = 1|ij) = 1// 2 i wh dt 


The method of maximum likelihood is used to obtain estimates for a, b, and c in both 
logit and probit analysis. (A computer program originally written by John Gragg, De- 
partment of Economics, University of British Columbia, and modified by Peter Stopher 
for CDC 6400 was used in estimating the models.) 

In discriminant analysis, no dependent variable exists. The objective is to findsuch 
linear combination of the explanatory variables that their joint distribution, the distri- 
bution of y, for the 2 groups would possess very little overlap. This discrimination 
rule classifies an observation at y as coming from population 1 if f,(y)/f(y) >k, and 
otherwise from population 0. If we assume that y is normally distributed, as is con- 
ventionally done, then, after some serious manipulation, 


Pr(M = 1ij) = e!"(9/) /[4 + er i(/a)y (6) 
and 
Pr(M = O|ij) = 1/[1 + e"(P/4)] 


where p and q are the a priori probabilities of group membership 1 and 0 respectively. 
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DATA 
Data Source 


The data used were originally prepared by Lisco (2) and are well documented in his 
thesis; only a couple of comments are in order here. 

The data consist of 159 work trips made during the morning rush hour from Skokie 
to the Chicago Loop. Only a binary choice (automobile or rapid transit) was available 
to the trip-makers. The corresponding travel times and costs were compiled for each 
segment of an individual trip; these disaggregate figures formed the basis for the trip 
segmentation. 


Trip Segmentation 


The system shown in Figure 1 will help explain the trip segmentation procedures. 
(The actual system was more complicated because of train transfers needed by some 
of the travelers. This does not change the principles, however.) Links 1 through 7 
describe the transit network, and links 8 through 10 describe the automobile network. 
Times and costs associated with each link were obtained for each element in the sam- 
ple. These transportation system attributes are given in Table 1. 

Three user attributes—income, number of workers in household, and automobile 
ownership—and 1 indirect system attribute—walk access to rapid transit station—were 
also included in the data. These socioeconomic attributes are given in Table 2. 

Twelve models were estimated by combining the travel time and travel cost differ- 
ences in different ways. Of the socioeconomic attributes, dummy 1 and dummy 2 are 
included in every model; but if income was used, then automobile ownership was not, 
and vice versa. 

A description of the models is given in Table 3. Next to the description column, a 
relationship is given to indicate how the time and cost differences were combined. Ac- 
cess refers to the trip from home to station, egress refers to the trip from station to 
work, and total access means access and egress taken together. Excess refers to the 
time spent outside the vehicle, for either walking, waiting, or transferring. Travel 
time differences are transit minus automobile, and travel cost differences are automo- 
bile minus transit. Exceptfor the socioeconomic variables, given in Table 2, model 7is 
the same as the one used by Quarmby (3), model 8 is the one used by Lisco (2), and 
model 9 was used by TRC (6, the TRC model used ratios instead of difference of re- 
spective travel times and costs). The system variables in these models were excess 
time (not in Lisco's model), total time, and total cost or out-of-pocket cost (TRC model). 


RESULTS 


Two kinds of evaluations were made on the basis of the results: (a) What is the best 
method? and (b) What is the best model? Ideally, the evaluation of the methods andthe 
models should be done with a set of data other than that used for the model calibration. 
However, in the present study the original body of data was already quite small (159 
individuals), and splitting that would only have left too few either for the model estima- 
tion or for the control group. An alternative procedure was adopted. It involved taking 
random samples from the data, computing the corresponding probabilities for each in- 
dividual, summing them up to the expected value, and comparing the actual and expected 
values. By taking enough samples and getting the expected values and their standard 
deviations for each method and model, one could perform statistical tests (t-tests) to 
see whether there are any differences among the 3 methods or among the 12 models. 

Twenty samples of 20 individuals were drawn, and the t-tests were undertaken. Un- 
fortunately, no differences either among the methods or among the models were detected 
this way. The hypothesis that the results are statistically equal could not thus be re- 
jected. 

It was, therefore, decided to engineer the answer to these 2 questions. First, the 
methods were checked for dominance. No model could be excluded because of dominance 
(based on all 3 methods). The decision was then made to rank the methods and the 
models. This ranking was based on multiple criteria. The ranking criteria were di- 


Figure 1. Transportation system between Skokie and Chicago Loop. 
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Table 1. Time and cost variables of transportation system. 
Notation 
Link Description Time Cost Note 

1 Walk to car or bus Tl = 

2 Wait for bus T2 - If car aceess to station, T2 =0 

3 Take car or bus to station T3 c3 

4 Walk to station platform T4 - If walk access to station, Tl = T2 = T3 = C3=0 

5 Wait for train TS 

6 Line haul in train T6 cé C6 = train fare 

7 Egress to work T7 - Walk was only egress mode 

8 Walk to car Ts - 

9 Drive to parking lotin Loop T9 ca T9 includes parking time; C9 excludes parking cost 
10 Egress to work T10 C10 Walk was only egress mode; C10 = parking cost 
Table 2. Socioeconomic variables. 

Notu- Models Used 
Description tion in 
Income (household or unrelated individual) 1 1 through 9 
Workers per household (dummy 1, 0 if 1 and 1 if otherwise) D1 All 
Walk access to station (dummy 2, 0 if distance is 5 miles 
and 1 if otherwise) D2 Al’ 

Automobile ownership A 10 through 12 
7One-hall mile was data-supported walk access distance to station 
Table 3. Description of models. 

Time Cost 
Model Description Relationship" Description Relationship” 
1 Access Tl+ T2+T3+ T4+ T5- TS Access -C3 
Line-haul T6- T9 Line-haul c3 - C6 
Egress TT -T10 Egress C10 
2 Access Same as model 1 Total access c10- C3 
Line-haul Same as model 1 Line-haul Same as model 1 
Egress Same as model 1 
3 Total access T1+ T2+T4+7T5+ TT-T8-T10 Total access Same as model 2 
Line-haul Same as model 1 Line-haul Same as model 1 
4 Excess T1+T2+ T4+ 1T5- Té- T10 Total access Same as model 2 
In-vehicle 
access T3 Line-haul Same as model 1 
Line-haul Same as madel 1 
5 Excess Same as model 4 Total access Same as model 2 
In-vehicle T3+ T6- T9 Line-haul Same as model J 
6 Excess Same as model 4 Total access Same as model 2 
7 10 
Total Do Pe Dy Pi Line-haul Same as model 1 
i=l i-8 
7 Excess Same as model 4 Total c9 + C10- C3 - C6 
Total Same as model 6 
Total Same as model 6 Total Same as molel 7 
9 Excess Same as model 4 Qut-of-pocket C10 - C6 - cor 
Total Same as model 6 
10 None None 
ll None Parking c10 
12 Excess Same as model 4 Parking C10 


“See Table 3. YET = Bus fare to station oF parking cost at the station, no driving costs, 
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vided into 2 categories: classification criteria and criteria based on expected values. 
The classification criteria included 3 items: 


1. Misclassified automobile users, 
2. Misclassified transit users, and 
3. Proportion classified correctly. 


A group membership probability of >0.5 was used as a rule for correct group classifica- 
tion, and the correctly classified proportion was obtained based on these classification 
results. The classification criteria are given in Tables 4, 5, and 6 for the logit, probit, 
and discriminant methods respectively. 

The expected-value criteria were computed based on the results of the 20 random 
samples and included the following: 


1. Average absolute error, which = [(actual number of automobile users - expected | 
value of automobile users) /20]; 

2. Percent error, which = (1/20) [(actual number of automobile users - expected 
value of automobile users)/actual number of automobile users]; 

3. Standard deviation of error, and 

4, Rank sum of the 20 predictions. [This item was obtained by ranking the results 
of 20 sample predictions (in case of a tie, the rank average was assigned for each tied 
value) and summing the ranks for each model. The lower the rank sum is, the better 
the model is.] 


These results are given separately for each method in Tables 4, 5, and 6. 

In the discriminant analysis method, 2 values have been given for classificatory 
measures. The upper ones were derived by using sample proportions as the a priori 
probabilities and the lower ones by using 0.5 as the a priori probability (Eq. 6). The 
latter a priori probability produces better classification results. The same does not 
hold true for the expected-value statistics (Tables 4, 5, and 6), however; but the sam- 
ple proportions as a priori probabilities now give better results. In the ranking anal- 
yses reported below, the classification rankings correspond to those values obtained 
with 0.5 a priori probability. 


Ranking Analysis of Estimation Methods 


The methods were ranked separately by using the classification criteria and the 
expected-value criteria. Each column for each model was assigned a rank of 1 to 3. 
The best method was assigned the rank of 1. In case of a tie, the average of a rank 
sum was assigned for each tie. The ranks were then summed by column and row to 
yield a rank sum for each method. The inspection of the statistical measures given in 
Tables 4, 5, and 6 indicates that discriminant function always produces the lowest stan- 
dard error. An experiment with another data set proved this is not always true. Hence, 
this criterion, expected-value criterion 3, was droppedfrom the ranking of the methods. 
The priority ranking of the 3 estimation methods is given in Table 7. The results of 
the ranking analysis indicate that the logit analysis is the best method. Examination of 
data given in Tables 4, 5, and6 reveals, however, that the differences between logit and 
probit analyses are very small. In any case, logit analysis seems to slightly edge both 
probit and discriminant analyses. 


Ranking Analysis of Model Specifications 


A similar ranking as undertaken for the methods of estimation was performed for the 
12 model specifications. In addition to the classification criteria and criteria based on 
expected values, the number of variables in the model was used in ranking the models 
to input (approximately, of course) the data collection and model estimation costs. 
Ranks run from 1 to 12; no averaging was done for ties, but the lowest tied rank was 
assigned for all ties. The results of this ranking analysis are given in Table 8. 

Three comments may be made on the basis of these ranking results. First, models 
without any explicit system variables appear to be best accorcing to the classification 
criteria. Model 10 and model 11 rank as first and second. According to expected-value 
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Table 4. Ranking criteria for logit 


estimation method. 


Classification 
Criteria Expected-Value Criteria 
Model 1 2 3 1 2 3 4 
1 20 16 0.774 1.04 0.3 0.88 142.5 
2 20 17 0.767 1.02 0.5 0.90 130 
3 22 18 = «0.748 1.08 0.7 0.94 143 
4 21 4 0.779 0.98 1.1 O93 118 
5 21 14 0.779 0.97 O.7 0.92 117.5 
6 21 14 0.779 0.98 O8 0.93 112 
7 20 14 0.786 0.98 O.7 O.91 116.5 
8 23 4 0.767 1.10 6.8 O98 140.5 
9 22 13 0.779 0.96 1,3 0.97 118.5 
10 21 16 = (0.767 1.07 0.8 1.16 115.5 
11 18 wi 0.779 1.15) O 1,15 144 
12 21 16 (0.767 1.12 0.9 1.24 151 


Table 6. Ranking criteria for discriminant 


estimation method. 


Table 5. Ranking criteria for probit 
estimation method. 


Classification 
Criteria Expected-Value Criteria 
Model 1 2 3 1 2 3 4 
1 22 #16 «400.762 «1.05 0.5 0.87 138.5 
2 21 16 0.767 1.02 0.5 0.89 129 
8 22 18 «600.748 «61.08 «40.7 0.93 141.5 
4 22 14 0.773 0.99 1.1 0.92 118 
5 22 #15 0.767 0.99 O09 O.81 121 
6 22 14 «400.773 «61.00 0.9 0.91 122.5 
7 20 13 0.792 O99 1.1 0.89 119 
8 23 «#14 «60.767 «1.10 09 O97 144 
9 22 13 0.779 0.98 1.6 0.96 119 
10 21 #16 O.767 1.06 O4 1.41 110.5 
1 23° 4#«15 0.762 1.15 14 41.10 149 
12 21 16 0.767 1.11 O1 1.20 148 


Table 7. Ranking of estimation methods. 


Criteria Expected-Value Criteria Rank Sum by Criteria 
Model 1 2 3 1 2 3 4 Expected 
Method Rank Classification Value Total 
1 38 § 0.710 1.09 0.8 0.48 134.5 
12 30 0.736 Logit 1 63.5 64.5 128 
2 37 7 0.729 1.08 0.7 O50 128 Probit 2 72 75.5 147,5 
12 29 0.743 Discriminant 3 80.5 76 156,5 
3 36 7 0.729 1.08 0.5 0.55 119 
13 29 0.736 
4 44 fl 0.691 1.05 0.7 O44 122.5 
13 31 0.724 
5 44 3 0.704 1.06 0.8 0.42 131 
15 29 0.724 
6 44 40.898 «= 1.06 «0,8 «(0.43187 . Be. de 
14 310,717 Table 8. Ranking of model specifications. 
‘‘) 40 6 0.710 1.07 O.5 0.44 132 
16 29 0.724 
8 40 ll 0.680 1.11 6.3 0:59 123.5 Model by Criteria 
13 34 0,704 
b] a7 10 0.704 1.06 0.3 0.52 130.5 Expected 
14 28) (0.736 Rank Classification Value Total 
10 58 1 0.630 1.08 0.7 0.53 124 
21 16 0.767 1 10 7 vs 
11 44 5 0.691 1.10 09 O0<.50 185 2 uu 5, 10 10 
17 18 0.779 3 7 4 4, 5, 10 
12 47 2 0.691 1,08 O09 0.49 146 4 9 9 6 
14 23 0.767 5 5 6 2 
Table 9. Coefficients and standard deviations of model 5. 
Logit Probit Discriminant 
Variable Coel. 8.D, Coef. §.D. Coef. S.D. 
Constant term -0.384 1.722 -0,337 1.040 -0.358 - 
Time 
Out-of-vehicle -0.316 0.114" -0.173 0.0639" -0.114 0,147 
In-vehicle" -0.0206 0.0559 -0.0165 0.0321 -0.0085 0,0037* 
Total 
Cost 
Access” 0.0158 0.0088 0.0087 0.0052 0.006 0,0048 
Line-haul 0,0414 0.0248 06,0249 0.0148 0.0170 0.0129 
Total 
Income* -0.0136 0.0056* -0.0074 0.0033" -0.0056 0,0018" 
Automobile ownership 
2 workers per household 1.910 0.507° 1.087 0.284* 0.693 0.216" 
Walk access 2.176 0.780* 1.277 0.447" 0.711 0.226" 


Significant at -0.05 level. 


© Time differences are transit time minus automobile time in hundreds of seconds, 
“Cost differences are automobile cost minus transit cost in cents, 


°\n thousands of dollars. 


Table 10. Coefficients and standard deviations of model 9. 


Logit Probit Discriminant 

Variable Coef. 8. D. Coef. 8. D. Coei. 8.D. 
Constant term 1.075 1.143 0.584 0,694 0.288 - 
Time 

Out-of-vehicle -0.271 0.141 -0.145 0.0806 -0.103 1,34 

In-vehicle® 

Total -0.0327 0.0552 -0.0236 0.0319 -0.0157  0.0062* 
Cost 

Access” 

Line-haul 

Total 0.0165 0.0088 0.0092 0.0051 -0.0068 0.0034 
Income* -0.0157 0.0054" -0.0087 0.0031" -0.0070 0.0023" 
Automobile ownership 
2 workers per household 1.837 0,501" 1.032 0.278" 0.722 0.226 
Walk access 1.996 0.769" 1,174 0.443* 0,684 0.217 


“Significant at -0.05 level. 

Time differences are transit time minus automobile time in hundreds of seconds. 
"Cost differences are automobile cost minus transit cost in cents. 

“In thousands of dollars. 


Table 11. Coefficients and standard deviations of model 10, Table 12. Means of selected transit system 
variables. 
Logit Probit Discriminant Variable Value* Variable Value* 
Variable Coef. 8. D, Coef. §.D. Coef. §.D. Total travel time 3,624 Total travel cost 58,2 
Out-of-vehicle time 749 Line-haul cost 38.2 
Constant term 2.218 0,573 1.220 0.319 0.725 — In-vehicle time 2,875 Total access cost 20.0 
Time 
iE Se “Time is in seconds, and cost is in cents. 
In-vehicle* 
Total 
Cost 
Access* 
Line-haul 
Total 
Income* 
Automobile ownership ~1.460 0.343° -0.797 0.184" -0.592 0.126" 
2 workers per household 1.660 0.476 0.940 0,268* 0.623 0.194" 
Walk access 1.624 0.663* 0.963 0.376 0.511 0,162* 


“Significant at -0.05 level, 

°Time differences are transit time minus automobile time in hundreds of seconds, 
“Cost differences are automobile cost minus transit cost in cents. 

"in thousands of dollars. 


Table 13. Effect of changes in travel time and cost on transit proportion. 


Automo- Automo- 

Transit bile Transit bile 

Time Time Cost Cost 

Up 30 Down 23. Up 30 Down 23 
Model Percent Percent Percent Percent Remarks 
Quarmby - 9.0 -8.0 Reported by McGillivray 
Wohl-Kraft ~ 7.3 -6.2 -1.6 -2.0 Reported by McGillivray 
McGillivray -7.5 -5.6 Reported by McGillivray 
Warner -6.4 -6.4 Reported by McGillivray 
Lisco -34.6 -B8.1 Change in total travel time and total travel cost 
Model 7 - 8.8 -7.0 Change in total travel time and total travel cost 
Madel 7 -15.0 Change in excess time 


Table 14. Value of time. 


Standard Percent 


Value Error of 
of Time of Value Wage 
Model Method (S/hour) of Time Rate 
7 Logit 0.71 1.17 12,5 
Probit 0.22 1.23 16.0 
Discriminant 0.83 = 14.0 
8 Logit 3.49 3.02 62 
Probit 3.44 2.90 61 
Discriminant 3.60 - 64 
9 Logit 0.62 0.82 11 
Probit 0.71 0.86 12.5 


Discriminant 0,65 11.5 
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criteria, however, models that have explicit system performance variables rank best. 
Model 7 and model 5 rank as the first and second. 

Second, it is interesting that model 10, which has only 1 system variable (a dummy 
for walk access), and user attributes (car ownership, which probably is not system in- 
dependent, and number of workers in the household), seems to be as good as models 5 
and 7. Thus, the decision to buy a second car often signifies car choice, and the deci- 
sion to reside near the transit station signifies transit choice. These choices are, of 
course, conditioned by socioeconomic factors such as the number of family members 
in the labor force and income. The effect of the extent of the public transportation sys- 
tem on modal choice may be negligible at present. The good performance of model 10 
is not of spurious nature, as the actual survey proved (7, p. iii): 


This study indicated that diversion of Loop-bound trips ... is made mainly from other rapid transit 
modes, along with suburban railroads and buses, with only a small number being diverted from au- 
tomobile trips. 


Third, the results of the ranking analysis indicate that model 12, which included only 
those variables that were statistically significantly (at the 0.05 level) different from 0, 
consistently showed poor performance. The customary null hypothesis, b = 0, may not 
be a good one. There seems to be a reason to remember that 0 is a very particular 
coefficient; it implies no relation between the dependent and the explanatory variables. 
Why should a variable be excluded solely because its coefficient, which is the maximum 
likelihood estimate, has a wide standard error? In this study, all the variables were 
originally included because they should have an effect on modal choice and not because 
they were available in the data set. Coefficients and standard deviations are given in 
Tables 9, 10, and 11 for models 5, 7, and 9. 


COMPARISONS WITH OTHER SIMILAR MODELS 


McGillivray (5, p. 40, Table 13) computed changes in modal split for the policies of 
increasing relative travel time by 30 percent and increasing relative travel cost by 30 
percent for 4models. These results are reproduced here along with 3 new results for 
the same policies. The new results are those for Lisco's original model, for model 7 
when the change is in total travel time and total travel cost, and for model 7 when the 
change is in excess time (Tables 12 and 13). 

With regard to a 30 percent increase in transit time, it appears that the 4 results 
reported by McGillivray are largely identical; models developed by Quarmby, Wohl- 
Kraft, McGillivray, and Warner estimate the change in transit proportion to be between 
62 and 9 percent. A much different result is obtained by using Lisco's model; there 
the same increase in transit time would decrease the transit proportion by 34% percent. 
Model 7 estimates the decrease in transit proportion to be nearly 9 percent in response 
to the 30 percent increase in total transit time; a result similar to those reported by 
McGillivray. However, if the change is in excess time, then model 7 indicates a 15 
percent reduction in transit proportion. 

With respect to travelcosts, the results reported by McGillivray indicate that a 30 per- 
cent increase in transit cost (or 23 percent decrease in automobile cost) would decrease 
the transit proportion by 6 to 8 percent. There is an exception: The Wohl-Kraft model 
estimates this change to be about 2 percent. The results obtained by Lisco's model and 
model 7 give support to the 6 to 8 percent figure; the percentages for Lisco's model and 
model 7 are 8 and 7 percent respectively. 

Three comments are in order on the basis of these results. First, model 7, where 
the travel time was broken down into excess and total travel time components, suggesis 
that travelers value excess time differently from in-vehicle time. This result was ob- 
tained also by Quarmby and Kraft and Wohl. These models, Quarmby, Wohl-Kraft, and 
model 7, also estimate an identical magnitude for the change in transit proportion in 
response to a change in total travel time. Second, it is somewhat surprising thatmodels 
by McGillivray and Warner on the one hand and by Lisco on the other estimate such dif- 
ferent responses, even though the models are largely similar and do not include the 
excess-time term. McGillivray and Warner models give results similar to the other 
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models mentioned above, but Lisco's model estimates modal choice tobe much more sen- 
sitive to changes in traveltime. Third, changes in modal splitin response to changes in 
travel cost appear to be equal by all the modal-choice models; however, Wohland Kraft, 
who use adifferent type of model, estimate less sensitivity with respect to travel cost. 


VALUE OF TIME SAVED 


Values of time saved, or more commonly values of time, were computed from model 
7 (excess time, total time, and total cost), from model 8 (total time and total cost), and 
from model 9 (excess time, total time, and out-of-pocket cost). The results, including 
the standard error of the value of time and value of time as a percentage of the wage 
rate, are given in Table 14. 

Value of time, in dollars/hour, was computed as (bi/bz) x 0.36, where bi and bz are 
the estimated coefficients of total travel time difference and total cost difference re- 
spectively. Variance (b:i/bz) is computed from the following formula: 


Var (b:/bz) = 1/b3 [b3 var (bi) + bi var (bz) - 2bibe cov (bib2)] 
+ 1/b3 [bi var (be) + be cov (bibs)] 


It appears from the results that all 3 methods of estimation produce approximately 
the same values of time. Models 7 and 9, which both have excess and total travel time 
variables, with the cost term being total cost difference in the former and out-of-pocket 
cost difference in the latter, obtain largely equal values of time. The average value of 
time for model 7 is 82 cents/hour and for model 9 is 66 cents/hour. These ratios are 
about 14 and 12 percent of the wage rate respectively. 

Model 8, which has total travel time and total travel cost variables but no excess 
time variable, obtains a much higher value of time: approximately $3.50/hour or 62 
percent of the wage rate. (In Lisco's study the time value was 40 percent of the wage 
rate, and in Quarmby's 20 to 35 percent.) 

The standard errors of all value of time estimates are quite large. Data given in 
Table 11 show that the standard errors are about equal to the values of time themselves. 
Two comments may be made on the basis of the results. First, the out-of-vehicle and 
in-vehicle times must have extremely different values because of the large difference 
in value of time depending on whether it was computed from a model where excess time 
is explicitly accounted. This result was also indicated by the results obtained in the 
previous section, where choice of mode was much more sensitive to out-of-vehicle than 
in-vehicle times. From the coefficients of models 4, 5, 6, 7, and 9, it may be inferred 
that out-of-vehicle time is 6 to 15 times the in-vehicle time, most of the values being 
around 7. [Quarmby found the out-of-vehicle time to be 2.5 to 3.0 times the in-vehicle 
time. In Ergtin's recent study the value of walking time was estimated to be between 
$4.50 and $11.50 (8). This result is in general agreement with the findings of this pa- 
per.] Second, in spite of the large standard errors estimated for the value of time in 
this study, it appears that the value of time savings may not be so large as previously 
believed. This concerns especially the in-vehicle time. Therefore, to be realistic 
any economic study of a transportation improvement must consider the out-of-vehicle 
and in-vehicle times separately. 


CONCLUSIONS 
Three major conclusions of this paper are as follows: 


1. The methods of estimation, commonly used in probabilistic modal-choice models, 
probit, logit, and discriminant analyses, all yielded comparable results. Any of them 
can be used with equal success. 

2. At present, the modal-choice behavior of travelers appears to be only marginally 
influenced by the travel times and travel costs. This, in turn, implies that travel ser- 
vice by automobile and transit are not perfectly substitutable services and should, there- 
fore, be modeled separately. 
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3. The values of travel times obtained are substantially lower than those previously 
reported. In particular, the value of out-of-vehicle time is much different from the 
value of in-vehicle time. This fact should be recognized in any economic study of a 
transportation improvement if travel time is given a monetary value. 


REFERENCES 


1. Warner, S. Stochastic Choice of Mode in Urban Travel: A Study in Binary Choice. 
Northwestern Univ. Press, 1967. 

2. Lisco, T. The Value of Commuters' Travel Time: A Study in Urban Transporta- 
tion. Dept. of Economics, Univ. of Chicago, PhD dissertation, 1967. 

3. Quarmby, D. Choice of Travel Mode for the Journey to Work: Some Findings. 
Jour. of Transport Economics and Policy, 1967. 

4. Stopher, P. R. A Probability Model of Travel Mode Choice for the Work Journey. 
Highway Research Record 283, 1969, pp. 57-65. 

5. McGillivray. Classification of Trips by Mode. Dept. of Economics, Univ. of Cali- 
fornia, Berkeley. 

6. Hill, D. M., and Von Cube, H. G. Development of a Model for Forecasting Travel 
Mode Choice in Urban Areas. Highway Research Record 38, 1963, pp. 78-96. 

7. Village of Skokie Mass Transit Demonstration Project. Chicago Transit Authority. 

8. Ergiin, G. Development of a Downtown Parking Model. 


APPROACH TO PROBABILITY DISTRIBUTION OF 


VALUE OF WALKING TIME AND 
PEDESTRIAN CIRCULATION MODELS 


Jean Francois Allouche, Bureau Central d'Etudes pour les Equipments d'Outre-Mer, 
Paris 


eTIME is money. So goes the dictum, but not until engineers began to apply economic 
analysis to transportation plans was there any real concern about how much money it 
was worth. The quantitative concept of a value of time has been used for some time 
but mostly by engineers trying to improve and rationalize manufacturing processes. 
For those analyses, the value of time can be easily taken into account through salaries, 
prices, and interest rates; time saved is equivalent to labor saved or additional pro- 
duction or shorter hours and faster capital turnover. 

But the transportation system moves people—people as consumers much more than 
people as agents of production—and only their behavior can tell what subjective value 
they assign to their own time. [In most U.S. urban areas, for instance, truck trips 
represent about 5 percent of all vehicle trips, and business trips amount to approxi- 
mately 10 percent of person movements. Work trips, preempting leisure time and 
not production time, do not, at least under our present social organization, fall in the 
"production" category (12, p. 81).] This is more than an academic problem; indeed, 
one of the most important quantifiable benefits in highway programs, a part of the 
transportation system, consists of savings in travel time. 

Attempts have been made to measure the value of time, or rather the values of 
time. Factual research as well as psychological inference indicates that our valuation 
of time is influenced by a variety of factors, ranging from the types of activities we 
pursue to our levels of income and the amounts of time being saved. If we restrict 
our discussion to time spent in transportation, the perceived value of time depends on 


time as a constant. Two papers only treated the value of time as a stochastic vari- 
able. Pratt (8), using the central limit theorem approach in statistics, hypothesized 
that the interplay of numerous subjective factors, including value of time and inconve- 
nience cost, would result in a normal distribution of the "catch-all index" by which 
individuals compare 2 travel modes (the index being 0 when a person finds both modes 
equally attractive). St. Clair and Lieder (9), studying a toll versus free highway 
situation, assumed a normal distribution for both the value of time and the inconve- 
nience cost (as measured by the number of speed changes on each alternative route). 
By successive trials, they determined, for the mean and standard deviation of both 
distributions, values that would yield the closest approximation to observed route 
choices. Their approach, however, does not allow for a statistical test of confidence 
of their assumption. A second round of analysis performed by Thomas on the data 
used in his original study (7) of commuter's values aims at defining the value of time 
as a variable function of income and of the amount of time saved, but it is not a proba- 
bility distribution. Apparently, there has been no attempt to determine directly an 
empirical probability distribution for the value of time. 

Also, from a different standpoint, it is remarkable that most researchers have 
confined their investigations to the value of driving or riding time. With the recent 
upturn of interest toward public transit in urban areas and the growing attention given 
to people-mover systems, it becomes equally important to estimate the value of walk- 
ing time. (People-mover systems are very short-haul facilities and can best be de- 
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scribed as distributor systems within major activity centers such as the CBD, shopping 
centers, campuses, and airports.) 

The present paper proposes a method that can be used to determine the actual prob- 
ability distribution of the value of walking time to motorists and has no assumption as 
to its mathematical form. The resulting curve then either can be used directly in 
modeled-choice procedures or can be submitted to statistical tests in order to deter- 
mine the best mathematical approximation for incorporation in a predictive model of 
pedestrian behavior. 


THE MODEL 


The model is based on the behavior of car drivers selecting a parking location be- 
fore they walk to their destinations. In a first-stage approximation, we will assume 
that only parking fee and walking time influence the decision-maker. This eliminates 
factors such as weather, environmental quality, or street gradient. It is legitimate 
to disregard grade if the study area is reasonably level, and, except with shoppers, it 
is unlikely that parking decisions take much account of the environmental quality along 
the path followed to on-foot destination. 


Formulation 


We hypothesize that a driver tends to minimize his total generalized cost (money and 
time). 


C=c+xd (1) 
where 


C = total cost, 

c = parking fee for desired parking duration, 

d = distance from garage to on-foot destination, and 
x = disutility cost of walking 1 unit of distance. 


If a driver selects parking facility 1, parking fee ci, and distance di from his des- 
tination, rather than facilities 2, 3, ..., N, we assume that 


Ci + xdi = c, + xdy for al) | 32/35 .ccc, N (2) 


Three cases are possible: 
1. If d, - d, >0, then Eq. 2 implies that 
x <(c; - c,)/(di - d,) (3) 
2. If di - dj; =0, then Eq. 2 implies that 
Ci=C; 
i.e., if our original assumption of rational trade-offs between time and walking is 


correct, the driver should select the cheaper facility. But this tells nothing about x. 
3. Ifdi-d,<0, then Eq. 2 implies that 


x2 (ey eS c,)/(d, = d;) (4) 


Each parking decision then results in a list of inequalities. The whole set of in- 
equalities imposes on x a lower bound or an upper bound or both. An additional con- 
straint on x is that, inasmuch as the model is valid, no negative value should be 
accepted for the distance disutility cost. This assumption is validated by most people's 
behavior: They like to park close to their destinations. Therefore, in all cases we 
have 


L<x<M (5) 


or 
L<x (6) 
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Graphically, the argument goes as follows (Fig. 1): Total cost for a driver parking 
in facility i and walking to building j is 


fi) = Cy + Xd; (7) 


For a given parker, j is fixed and Eq. 7 can be represented by a straight line in the 
coordinate system x, g. All parking facilities can be so represented. Whatever x, 
the disutility cost to him, the driver will consider only the minimum cost curve 
([PRST (F), Fig. 1]. 

If the driver selects facility C Lline C, Fig. 11, then x, his valuation of distance 
disutility, must belong in the range r to s, for this is where C isthe minimum cost curve. 
Selection of F is the case exemplified by inequality (Eq. 6). It will deserve special 
treatment. Selection of A would mean that the driver has a negative distance disutility 
cost. Selection of E is merely unaccountable by the model in its present form. E is 
never the minimum-cost solution and is, therefore, termed a noncompetitive facility 
under the circumstances. Both of these aberrant cases will be briefly discussed 
hereafter. 

As already mentioned, a negative disutility cost of distance is at odds with the 
empirical observed behavior of an overwhelming majority of drivers. Negative disutil- 
ity cost values or selection of a noncompetitive facility can be interpreted without 
throwing the model away, however. They can arise from a nonuniform distortion in 
distance perception on the part of the driver. If there are only a few occurrences of 
drivers with negative costs, we will just dismiss the datum and tally only the percentage 
of people falling in that category for later use in an assignment model. 

The 2 aberrant cases can also arise from what we might term blurred rather than 
biased perception. People are probably little sensitive to small differences in dis- 
tances and may, therefore, make decisions apparently at odds with the "numbers." 

If negative disutility cost values are observed in significant number, we propose to 
investigate whether it can be traced to some slackness in sensitivity to distance by 
defining a sensitivity threshold, for instance. 


Histogram 


A histogram is a statistical representation describing the number of observations 
falling within a certain range (a to b). These observations are represented by a rec- 
tangle, the area of which is proportional to the number of observations and one side of 
which is the interval (a to b) on the horizontal axis of the graph. 

Figure 2 shows a distribution with 3 groups of observations: In group 1, 10 obser- 
vations are between 0 and 5; in group 2, 20 observations are between 5 and 10; and in 
group 3, 20 observations are between 10 and 20. Group 3, being spread over an inter- 
val twice as large as that of group 2, is assigned an ordinate twice as low. 

The limit of a distribution's histogram is a probability density function when inter- 
vals multiply ad infinitum and their width tends toward 0. In other words, the histogram 
constitutes an approximation to the probability density function after the area under its 
perimeter has been normalized to 1 (by rescaling ordinates), 

Such a histogram could be constructed step by step from the inequalities (Eq. 5) 
attached to each surveyed parker. Each parker is considered as being one observa- 
tion, and this procedure does not affect the final configuration of the histogram. 
Parkers with the same parking location and the same on-foot destination could be con- 
sidered as a group at this stage, but later considerations will call for individual pro- 
cessing. The range for the group is assumed to be the interval defined by the set of 
inequalities. If, as probable, the data come from a sample survey, each observed 
parker has to be weighted with the appropriate factor to expand sampled data to the 
whole population of parkers. 

The problem of parkers with an upper unbounded disutility cost of walking (inequality, 
Eq. 6) can be dealt with in 2 ways. 


1, By estimating an absolute highest disutility cost based on the highest upper 
bounds observed among other parkers and based also on common sense rationales, 


walking 
x = disutility 
cost 


Figure 2. 
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e.g., a value that would make a taxi ride a preferable alternative; and 

2. By calibrating a value of the upper bound that, in turn, would produce a best 
fit when the histogram is used for predictive purposes (this will be developed later in 
this paper). 


Distribution of the Value of Time 


In the preceding section we have proposed a method to build the distribution of the 
disutility cost of walking based on distance. An assessment of walking speed is nec- 
essary to derive the distribution for the value of time. 

Existing studies (11) show that pedestrian travel speeds (averaged over a complete 
portal-to-portal walk because speeds can vary significantly from block to block during 
the same walk) can be considered normally distributed, with significantly different 
means for men (4.93 ft/sec) and women (4.53 ft/sec). Standard deviations are similar 
(approximately 45 percent of the value of the mean). Here again, the problem can be 
approached in several ways: 


1. The speed is assumed to be constant and equal to the mean of the observed dis- 
tributions for men and women separately, if this information is reported in the survey; 

2. Walking speed is considered to be a normally distributed random variable with 
known mean and standard deviation and is assumed to be statistically independent of 
walking disutility cost; and 

3. Walking speed is considered to be linearly correlated in a positive way with 
distance disutility cost (the rationale on which this assumption is based is developed in 
the brief discussion hereafter). 


Regarding the first approach (constant average speed), once a unique speed has been 
set (possible one for each sex and purpose), multiplying the horizontal scale by that 
speed factor will transform dollars/ft into dollars/min, thereby yielding the distribu- 
tion of the value of time. Some studies have indicated a variation of walking speed 
with time of day, but this seems to be related primarily to trip purpose. If warranted, 
the model can be applied separately to each trip purpose. 

Regarding the second approach, we briefly discuss the assumption of independence 
between walking speed and distance disutility cost, on which it is predicated. A rapid 
and lively pace is often correlated with a certain liking for walking. It seems logical 
to assume that a great liking for walking is linked to a lower distance disutility cost. 
Conversely, a slow walking speed would be associated more often with a dislike for 
walking and a high distance disutility cost. The assumption of independence appears 
at best as a convenient simplification, pending a careful and much-needed test of its 
validity. If the kind of statistical correlation depicted above does exist but has to be 
overlooked for convenience reasons, the distribution of time value derived under the 
assumption of statistical independence will be flatter and more widespread than the 
"true" distribution. The computations required to develop the distribution under the 
assumption of statistical independence is described in the Appendix. 

The rationale in favor of a correlation (i.e., the third approach) is developed in the 
preceding paragraph. If the first and second approaches yield unsatisfactory results, 
it is possible to single out some of the observed parkers and track them later to mea- 
sure their average walking speed. (In a later section, tests will be suggested to eval- 
uate the reliability of the distribution produced.) Conditional distributions of distance 
disutility cost could be constructed for people with given walking speeds, and their 
correlation with speed analyzed. This is a long and costly operation, but there is 
good reason to think that, if such a correlation does exist, it depends primarily on the 
culture or the distribution of temperamental features among the population and therefore 
is rather constant from place to place, at least within a culturally homogeneous domain 
such as the whole of North American cities. Several definitions for stability are sug- 
gested in the Appendix. At any rate, once this tedious investigation is performed, the 
student of different cities could dispense with this special kind of tracking survey and 
use only standard parking surveys for data gathering. 
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PRELIMINARY EVALUATION OF THE MODEL 


Possibly the utmost advantage of the approach outlined in this paper is the very low 
cost of data collection. Practically all of the information required is available from 
standard parking surveys conducted at intervals in major American city center areas. 
Minor changes such as additional questions dealing with sex, age, or income could be 
accommodated at practically no cost if one wanted a stratified application of the model. 
Data processing would be slightly more sophisticated than for a standard parking study, 
but the basic program for histogram building is neither sophisticated nor very long and 
can be used again in different studies. 

Many situations in which walking can be traded off for money in a downtown environ- 
ment are of a multiple-choice nature. More than 2 modes are often available. More 
than 2 routes can easily be envisioned. The parking decision on which this paper is 
predicated also involves multiple competition as soon as there are more than 2 parking 
facilities open to the public. This multiple-choice character inherent to the problem 
thus precludes the use of discriminant analysis. On the other hand, competition among 
many garages is beneficial to our approach; the more facilities there are, the better 
our chance is to define a narrow interval containing the driver's (unknown) value of 
time. 

The model proposed here has conceptually more explanatory power than models 
based on discriminant analysis, although the latter may produce relevant predictions 
at the aggregate level. A discriminant analysis type of model 'does not specifically 
estimate the route choice for the individual motorist; rather, it predicts the action 
expected of an 'average' motorist when faced with the given route choice situation"! (7 
p. 59, Thomas comments about his own model only, but the statement can be applied 
to any model of the discriminant analysis type). On the other hand, if supplied with a 
decision-maker's actual value of time, our model is capable of predicting that person's 
decision. In our model, probabilities do not reflect people's indeterminacy or ''unpre- 
dictability" (i.e., people completely and identically defined in terms of Thomas' model 
formulation have to be arbitrarily assigned opposite decisions in order for his model 
to work satisfactorily) but rather uncertainty on our part as to their (well-defined) 
time valuation or other similar parameters. 

Using a parking fee as the dollar element of our "total cost'' function eliminates the 
trap of "perceived costs" in which so many previous studies have fallen when, for 
instance, car-operating costs are used. 

We now turn to the shortcomings or difficulties we see involved in the model. 

The first problem involves time perceived versus time actually spent (or, for that 
matter, distance perceived versus actual distance), All models share this problem. 
However, if the decision-maker systematically overestimates or underestimates all 
actual time durations by the same factor, this merely amounts to scaling down or up 
by that factor the value of time computed from actual duration. The revised value is 
the person's value for each actual unit of time and can be used sensibly for evaluation 
purposes. In that case, adding a question about perceived walking time in the parking 
survey should take care of the problem inasmuch as people are sensitive to short-time 
durations and capable of estimating them within a reasonably narrow range. (We refer 
here not to people's biases but to their own indeterminacy. It is a case of "blurred" 
perception.) This procedure would eliminate the need for walking speed analysis, as 
developed above. Both could be later synthesized in an improved version of the model, 
as will be discussed later. Perception biases that depend on circumstances cannot be 
integrated systematically in the model and will be assumed away, together with the 
multitude of other random influences. 

The influence of the weather on the disutility cost of walking (which often takes place 
outdoors) does not need demonstration. Using data gathered on a rainy day would lead 
to the value of time spent carrying an umbrella or dodging raindrops. Although this 
is not uninteresting, it is suggested that the first analysis be conducted on data per- 
taining to moderate weather conditions. Results will be usable for a larger set of 
circumstances, including semicovered malls and indoor facilities. 

It has been argued that short-term and long-term parkers do not have the same value 
of time. Although it is obvious that they do not attach the same value to parking time, it 
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is not evident that their valuations of walking time are different. A discussion in the 
Appendix demonstrates that, as parking duration increases, cheaper and more distant 
facilities become preferable. This establishes the necessity of treating each inter- 
viewed parker separately to assess his exact parking fee, either by direct questioning 
or by using rates and recorded parking duration. Short-term and long-term parkers 
could, of course, be processed separately, if so desired, and separate histograms 
prepared. 

Drivers occupy parking facilities on a first-come, first-served basis. At certain 
times during the day, some facilities are saturated and the driver is faced with only 
a restricted supply. Figure 3 shows how this may bias the estimated range for the 
driver's distance disutility cost to the point where the actual value lies outside the 
estimated range. If all facilities were available, the minimum-cost domain is the line 
PQRTD, and a driver with a disutility cost x, selects facility C. As in the earlier sec- 
tion, we reason that his disutility cost range is b tod. When C is removed from the 
supply, the minimum-cost curve becomes PQRSTD, and the driver selects facility B. 
If we do not know that C is out of the supply, we will interpret his decision as evidence 
that his disutility cost range is a to b, which is erroneous. One could keep track of 
facility saturation by hours of the day in the survey, but it does not seem practical at 
this point. Moreover, saturation is not a stable condition: C may be saturated a 
moment and then become open again as B becomes saturated, and soon. The bias 
tends to smooth out the histogram rather than change its balance. Figure 3 shows that 
range a to b was mistaken for the "true" range b to d. But it is so only because x, is 
inferior to c. Had x, been superior to c, the driver would have selected facility D, 
thereby leading us to mistakenly assume range d to +a instead of the true range b to d. 
The net result is that people who should have been distributed in the b to d interval will 
now be represented on either side of the histogram. 


MODEL APPLICATION 


The probability distribution of the value of time (or walking distance disutility cost) 
can be used in many applications, which can be roughly categorized in 2 groups. 


1. Economic analyses estimating value of time savings accruing to pedestrians 
due to an improvement in pedestrian circulation, such as an overpass, a trail system, 
or a people mover, or evaluating alternate plans for a primarily pedestrian-oriented 
facility, such as a hospital, university campus, or civic center. (In case of a people 
mover, our curve would give a lower limit because the action of walking is considered 
a hardship by many, regardless of the time involved; a people mover mitigates this 
hardship.) 

2. Pedestrian assignment models based on a total-cost-function assignment of 
drivers with known on-foot destinations to parking facilities (enabling a comprehensive 
treatment of parking schemes and pedestrian systems) and an estimation of pedestrian 
traffic diverted to a new facility, such as an overpass, or a people mover. 


We use drivers' assignment to parking facilities as the example because it will help 
explain calibration procedures. Assume an average walking speed determined by the 
driver's physical characteristics (sex and age). Assume also 3 parking facilities A, B, 
and C and a group of drivers who have identical characteristics, have destinations in the 
same building, and are willing to purchase 1 hour of parking. Parking rates, walking 
distance, and time are as follows: 


Parking Walking Walking Total 

Rate Distance Time Cost 
Facility ($ /hour) (ft) (min) Function 
A 0.10 1,250 5 0.10 + 5x 
B 0.30 500 2 0.30 + 2x 


c 0.50 250 1 0.50+x 
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Figure 4 shows the minimum cost curve and the probability density function of x, 
the value of time, as functions of x. People with a value of time lower than a will 
select facility A. People with a value of time greater than b will select C. People 
with a value of time between a and b will select B. Their proportions are represented 
by the areas delimited by the probability density curve P (x), the x axis, and abcissas 
aandb. Knowing P (x), we can compute these proportions and can accordingly assign 
the group of drivers under study to the various facilities. 

In summary, the inputs required for this application are the pattern of final trip des- 
tinations and parking durations; the locations of parking facilities; parking rates for 
each facility; and the probability density function of the disutility cost of walking, or the 
probability density function of the value of time, plus some information about walking 
speeds (average or probability distribution). All inputs but the last are currently 
available. The output is an assignment to parking facilities of drivers with destinations 
in given buildings. 


CALIBRATION 


So far we have mentioned only 1 degree of liberty for the model: the absolute max- 
imum M, the value of time to be used in cases where inequalities (Eqs. 5 and 6) do not 
define an upper bound for x. (This section deals with value of time, but is readily 
applicable to the calibration of a model based on walking disutility cost.) 

It is possible to define an index of performance for the model with respect to which 
parameter M can be adjusted. To that end, we apply the model to a parker-assignment 
problem in a situation where a parking survey with origin-destination data is available. 

Let Y,,; be the number of people observed in the survey walking from parking facil- 
ity i to building j (the pair i-j is called an interchange). Let X,, be the number of 
people assigned by the model to interchange i-j. Ideally X,, = Y,;. If Xi; > Ys), then 
X,, - Yi, people have been erroneously assigned, thus creating an imbalance Kyys < 
Y,,,, in another pair i’-j‘. Thus, 


1=(% Iu - Yul) 22 Ys; 
1) 1) 

is the percentage of people erroneously assigned. (The number of pairs i-j may be 
very large. In some cases, it may be more practical to group interchanges by volume 
categories or into screen lines and apply correlation analysis to the aggregate. ) 

Each value of M will yield a certain value I (M) for the performance index. Trying 
several values of M over a reasonable range will give evidence of a trend for I (M). 

It can be shown that I (M) has an asymptote when M tends to infinity. This in turn will 
enable us to determine a value of M and to maximize I. If the fit between observed and 
synthesized interchange patterns is not satisfactory, another degree of liberty can be 
provided. 

So far we have used 0 as the lower bound for ranges that lack one. But there is 
probably a nonzero minimum value, L, to people's value of time. 

To calibrate the model with respect to M and L at the same time requires that some 
maximizing techniques be borrowed from nonlinear programming (such as steepest 
ascent or reduced gradient) so that the I (M, L) surface can be "climbed" on. They 
generally entail much computer time. Besides, in our problem the computation of 1 
point on the surface already involves a complete assignment run plus a performance 
analysis (the latter is relatively inexpensive). This makes the feasibility of a double 
calibration dependent on the cost of running the basic assignment program. 


POSSIBLE IMPROVEMENTS 


Stratification of the data by trip purpose, sex, age, and income may uncover signif- 
icant variations in the probability distribution of the value of time with respect to 
these factors. Logical considerations suggest the direction, if not the extent, of the 
influence of these factors on the value of time. 

A joint study of walking speed v, time perception bias b, and distance disutility 
cost z would give information about the mutual correlations of these 3 characteristics 
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in a given individual. Correlations between v and z and between b and v could be con- 
sidered as stable among cities (within a given culture, e.g., Northern Europe or 
South America). They depend mainly on cultural, noneconomic features. The math- 
ematics of a model based on such premises is developed in the Appendix. 

Street grade plays a double part in decisions entailing walking. First, it increases 
the actual walking time, and, second, it makes walking more exhausting and thereby 
increases its disutility cost. In a first approximation, we may assume that this in- 
crease Az in disutility cost is directly related to the street gradient g. 


Az = ag if g 20 


Az =0 ifg <0 (walking down a street) (8) 


and total cost for a parker would now be 


C= ke + tz + t(g) Az (9) 


where t(g) is the time effectively spent walking on a street with gradient g, t is total 
walking time, and @ is parking duration. The ideal situation to test this model and 
calibrate the parameter a would be a city with several dense nuclei, one in flat terrain 
and another ina hilly area. A first study conducted in the flat area would provide the 
distribution of z. A second study in the hilly core would determine the probability 
distribution of a, assuming the previously derived z distribution. We are working at 
this time on a model that could determine directly both the z-distribution and the a- 
distribution. 

It is interesting to note that the latter approach, if successfully developed, can be 
applied to any variable teamed with value of time (or distance disutility cost), provided 
that a scale is available for that variable. "Street attractiveness" or "environmental 
quality" in a flat area can be treated within this framework. The quality scale required 
could be based on the variations in walking speed. Hoel (11) has observed that the same 
pedestrian walks at varying speeds in the course of a trip, depending on the type of 
block he is walking along (shop windows, bank, factory, or parking lot). 


CONCLUSIONS 


This paper proposes a model of pedestrian behavior that can be tested in a real 
situation at a minimal cost, if coupled with a standard parking survey for data col- 
lection. 

Most of the paper deals with how to determine the probability density distribution 
of the value of walking time, which is believed to be the central element for a pedes- 
trian behavioral model applicable to a variety of situations (parking location selection, 
utilization of short distance people movers, or evaluation of pedestrian circulation 
improvements). 

Valuable improvements to the model can be introduced, if necessary, by adequately 
designing the parking survey questionnaire. 

The approach proposed allows for an incremental study design, concerned first with 
the value of time and then with other elements of the choice procedure such as street 
gradients or environmental quality. 

Advantages of the model in its basic form are the low cost of data collection; its 
multiple-choice nature, covering a wider range of situations than binary-choice models; 
its "explanatory" orientation in that it proposes a rationale for pedestrian behavior 
instead of a more numerical correlation and remains meaningful at the level of an 
individual decision-maker; and its avoidance of the use of dollar costs that are ill- 
defined in the decision-maker's mind (such as car operating costs). 

Among the model's shortcomings are the problem of time perceived versus actual 
time; the variability of walking speeds; the influence of the weather, which is unac- 
counted for; and the first-come, first-served rule of operation in the situation selected 
to calibrate the model that tends to distort the distribution of the value of time. Some 
of these drawbacks can be mitigated through investigation of particular interrelations 
such as that between walking speed and time perception bias, considered as permanent 
personal characteristics. Others are still beyond the range of analysis. 
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Evaluation criteria are proposed to test the model's reliability. This paper sug- 
gests that a first test be made in a medium-sized city that has a flat and rather uniform 
CBD. Subsequent tests can then be designed, if warranted, depending on the insuffi- 
ciencies evidenced by the first one. This paper has tried to anticipate some of the 
problems and to suggest solutions. 
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APPENDIX 
DERIVATION OF DISTRIBUTION OF VALUE OF TIME 
ASSUMED STATISTICALLY INDEPENDENT OF WALKING SPEED 
Let 
v = walking speed, 
t = walking time, 
6 = walking distance, 
x = value of time, 
z = distance disutility cost, 
p(z) = probability density function of z among the population, 
q(v) = probability density function of v among the population, and 
r(x) = probability density function of x among the population. 


p(z) is assumed known through the histogram method, q(v) is known from prior studies, 
and r(x) is to be determined. 
We have, by definition, 


6 =vt (10) 


Also, the overall cost of walking must be the same, whether computed on the basis of 
time or on the basis of distance. 
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xt = 26 (11) 
Equations 10 and 11 imply that 
x = zv for x, z, v20 (12) 


x is the product of 2 independent stochastic variables, and its distribution is 
Za 
r(x) = es p(z) * q(x/z) * dz (13) 


Z. and Zz, the limits of integration, are functions of the boundaries of the ranges per- 
mitted for z and v, which in turn define the range permitted for x. 


If p(z) and q(v) are step functions, Eq. 13 is changed into Eq. 14. 


N 
r(x) = plas + 2+1)/21* qRx/(z, + 2441)] (2144-21) (14) 
i=1 
where Z1, Ze, --+; Zs; are the abcissas at which p(z) jumps from one step to the next. 


Equation 14 is particularly relevant to the determination of r(x) by approximation, 
when p and q are only empirically defined, and therefore not amenable to theoretical 
calculus. 


DEFINITION OF STABILITY OF THE RELATION BETWEEN WALKING SPEED 
AND DISTRIBUTION OF DISTANCE DISUTILITY COST 


As mentioned earlier, there is the possibility that fast walkers are more walk-loving 
than average and thereby have a low distance disutility cost (or, better said, show a 
distance disutility cost distribution shifted toward the low values). Conversely, slow 
walkers would have a distance disutility cost distribution shifted toward the high values. 
This correlation can be formulated as follows: 


¥ = f(z) (15) 
Oy = g(z) (16) 


where 


Vv = mean value of the average walking speed v, and 
o, = standard deviation of v 


both for a subpopulation of given distance disutility cost z. 

Equations 15 and 16 are (at least theoretically) sufficient to define the distribution 
of v, shown by Hoel (11) to be normal. 

It is hypothesized that the correlation between v and z is stable. One definition of 
stability is to assume that the functions f and g are identical from city to city. But 
this would mean in turn that people with a given walking speed have the same average 
disutility cost in city A and city B. This is not obvious. 

Although, it seems reasonable that people's physical characteristics (exemplified 
here by "walking speed") are distributed in approximately the same way in various 
cities, differences in social and economic conditions may influence distance or time 
valuations by those physically similar people located in different cities. 

For instance, if wages, prices, or dividends were doubled overnight while people's 
preferences stayed unchanged, their distance disutility cost would also have doubled, 
although their walking characteristics would still be the same as before. Equations 
15 and 16 would then read 


v = f(z’) (17) 
Cy = g(z’) (18) 
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where 
z’ = new distance disutility cost = twice old cost = 2z. 
However, for everyone, disutility cost relative to the population's mean would be 
undisturbed by the overnight change. 
z'/mean z’ = 2z/mean (2z) = 2z/2 mean z = z/mean z (19) 


Consequently, one way of expressing the stability of the correlation between v and 
z, regardless of (intercity) differences in socioeconomic conditions, is to replace Eqs. 
15 and 16 with Eqs. 20 and 21, valid in both the before and after situations of the 
example given above. 


¥ = F(Y) (20) 
o, = G(Y) (21) 
where 


Y =2/Z, and 
Z = mean value of z among the city population. 


Taking the functions F and G to be the same in different cities is now a relatively 
safe assumption. 
A different but somewhat similar rationale could lead to ¥ and o, being functions of 
Y = (z - Z)/o,; these functions, like P and G, would then be considered valid for all 
cities. 
IMPACT OF PARKING DURATION ON TIME AND COST TRADE-OFFS 


The total cost C, to a driver selecting parking location i is 


Cy = 6k, + xt, (22) 
where 


k, = hourly rate of facility i, 

t, = walking time from facility i to on-foot destination, 
8 = parking duration, and 

xX = value of time. 


The cost differential between 2 facilities for the given driver is 


AC =Ci- C2=6* Ak+x At (23) 
where 
Ak = ki - ke, and 
At =t, - te. 


Assume that facility 1, which is optimal for duration 6., is competing with facility 
2, which is less expensive (Ak > 0) but more distant (Ax <0). For @,, AC is negative; 
i.e., 


6,* Ak+x *At<0 (24) 
But AC increases when 6 increases, and when 
8 > -(x At/Ak) (25) 


AC becomes positive and facility 2 is the preferred one. 

This shows that cheaper, more distant facilities become preferable when parking 
duration increases, if a constant value of walking time x is used. Of course, the 
marginal value of parking time k, tends to decrease. This is well in accordance with 
observations. 
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DERIVATION OF DISTRIBUTION OF VALUE OF TIME, ASSUMING 
THAT THERE IS A CORRELATION BETWEEN WALKING SPEED AND 
DISTANCE DISUTILITY COST AND A PERSONALIZED 
SYSTEMATIC PERCEPTION BIAS 


Let 


v = walking speed, 

T = actual walking time, 

t = perceived walking time, 

D = actual walking distance, 

6 = perceived walking distance, 

xX = perceived value of time, 

Zz = perceived distance disutility cost, 

Z = actual distance disutility cost, 

b = perception bias factor, 
p(Z) = probability density function of Z among the total population, 
r(x) = probability density function of x among the total population, 

q(v/Z) = probability density function of v among the subpopulation with distance 
disutility cost Z, and 

s(b) = probability density function of b among the total population. 


The perception bias b is assumed to be statistically independent of all other per- 
sonal characteristics z, v, x. The correlation between z and v is expressed through 
q(v/Z), as developed earlier. The distribution p(Z) is known from model application 
(histogram). The distribution s(b) is known from answers to a special question in the 
parking survey or prior studies on perceived time. Basic definitional relationships 
are 


t=b°T (26) 
6=beD (27) 
D=V°T (28) 
xt = zd = ZD (29) 


The model described in this paper enables us to construct the histogram of Z. We 
now want to relate x (value of perceived time) with Z (actual distance disutility cost). 
Equations 26 through 29 lead to 


x=(Z *v)/b (30) 


Therefore, the probability density distributions are related as follows: 


(31) 


As before, bounds can be imposed on v, Z, and b that will restrict the range of x. 
For each (permitted) value of x, the limits of integration are functions of the bounds 
imposed on v, Z, andb. 

As shown earlier Eq. 31 can be transformed to fit the case of step functions, partic- 
ularly relevant to the empirical determination of r(x), 


r(x) = f f v(2) * q(v/Z) + s(Zv/x) + dZ * dv 
Zi Vi 


TRAVEL TO OUTDOOR RECREATION AREAS 
IN KENTUCKY 


John A. Deacon, University of Kentucky; and 
Jerry G. Pigman and Robert C. Deen, Kentucky Department of Highways 


ABRIDGMENT 


®THE PURPOSE of this investigation was to examine the nature of travel to outdoor 
recreation areas in Kentucky. Basic data were obtained by means of a license-plate 
origin-destination survey at 160 sites within 42 major recreation areas in Kentucky. 
These data were supplemented by means of a continuous counting program at 10 sites. 
The O-D survey was conducted on Sundays during the summer of 1970; outdoor recre- 
ation travel and other rural travel typically reach a combined peak on summer Sunday 
afternoons. Overall results indicate that the license-plate O-D survey is a most satis- 
factory way to gather data of the type required, particularly because it enables maxi- 
mum utilization of personnel, does not require voluntary participation of the traveler, 
and allows a large sampling rate. The time selected for the survey, 10 a.m. to 8 p.m. 
on Sundays, proved to be completely acceptable. 

Modeling efforts concentrated on the simulation of distributed travel flows from each 
of 190 origin zones comprising the entire continental United States to each of the 42 
recreation areas. The actual distributed flows, treated as the dependent variable in 
the analysis, were the 10-hour departing vehicular flows on an average summer Sunday. 
Factors were developed to convert the 10-hour flows to peak-hour and 24-hour flows. 
Distributed traffic flows are known to be sensitive to demand for recreation at the 
origin zone, supply of recreation opportunities at the recreation area, price of the 
recreation experience, competition among available sources of supply and demand, and 
various other factors. Primary independent variables that were chosen for analysis 
and that reflect to a sufficiently accurate first approximation the most important of 
these sensitivities are population of the origin zone in thousands, p:, attractiveness 
of the recreation area, a;, and spatial separation in miles between the origin zone and 
the recreation area, d.:,, which is used as a measure of the price of the recreation 
experience. 

The attractiveness of recreation areas of varying types and sizes can be reasonably 
approximated by the number and types of facilities available. The following facilities, 
listed in the order of highest to lowest significance, were identified as having important 
effects on attractiveness and were judged to be essential for encompassing the wide 
range of recreation areas studied: water area, picnic tables, swimming pools, horse- 
back trails, beach, golf, hiking trails, overnight accommodations, and outdoor drama. 
The relative importance of those facility types was evaluated by using regression 
analyses. 

Others who have attempted to simulate distributed recreation travel flows have 
utilized gravity models, opportunity models, system theory models, and single-equation 
models. Because the available literature revealed no distinct preference for any model 
type, efforts of this study were concentrated on various single-equation models evalu- 
ated by regression techniques and on a cross-classification model. The cross- 
classification model was found to be an acceptable means for simulating and predicting 
outdoor recreation travel flows and was decidedly superior to any of the single-equation 
models evaluated. From the cross-classification model, per capita distributed flows 
were found to decrease at a decreasing rate with increasing population, increase at an 
increasing rate with increasing attractiveness, and decrease at a decreasing rate with 
increasing distance. 
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The best single-equation model for simulating distributed flows, V:,;, was of the form 
Vi = kid? pi? as* 


in which the k's are constants. This nonlinear flow equation as well as all others in- 
vestigated had to be evaluated by using nonlinear regression analysis. Linear regres- 
sion analysis using transformed (linearized) equations proved totally unsuitable. 

Other data, in addition to the distributed flows, were also available from the O-D 
survey. The average vehicle occupancy rate was found to be 3.13 persons /vehicle. 
However, rates at specific locations were found to depend on the length of the trip and 
the nature of the recreation area and were smallest for the short trips to predominantly 
day-use facilities. Vehicle classification (percentages of the various vehicle types) was 
also found to depend on both trip length and the nature of the recreation area. 


DEVELOPMENT AND TESTING OF A 
MULTIPATH-ASSIGNMENT TECHNIQUE 


P. M. Dalton and M. D. Harmelink, Ontario Department of Transportation 
and Communications, Downsview 


ABRIDGMENT 


®IN MOST conventional traffic-assignment techniques, the travel between 2 points is 
assigned to the minimum or shortest path between the 2 points. Even in capacity- 
restraint assignment, in which a capacity function or speed-flow relation is used in 
successive assignment runs to modify link speeds for use in the assignment, minimum- 
path assignment is used in each run. A distribution of trips over several paths is 
usually obtained by combining the results of successive assignment runs, 

A multipath-assignment technique has several advantages over the conventional 
minimum-path technique. First, in networks without capacity limitations (unrestrained 
or demand assignments), empirical studies have shown that travel between 2 points 
usually distributes itself over several routes. Further, minor network changes can 
often produce major changes in assigned volumes when minimum-path assignment is 
used. Second, in capacity-restrained assignments, the multipath technique enables 
each assignment to be considered on its own, so that (a) all the volume on a given link 
is assigned at the same speed, and the consequent link volumes are consistent with the 
speed-flow relation for the links, and (b) extreme speed oscillations and unrealistic 
paths produced by them are largely excluded. 

In the method developed and tested, the algorithms used for building the multiple- 
route trees are the same as those used in most conventional minimum-path assignment 
programs, except that a different set of link costs is used to build each tree. Each 
link time is chosen at random from a distribution of 8 values having a mean value equal 
to the specified link cost and a mean deviation specified by the user according to link 
type and cost. When a large number of trees are built, the paths will be divided among 
the feasible routes with the largest number normally on the minimum path and the num- 
bers on alternative paths decreasing as the extra cost involved in using them increases. 
This principle also applies between any pair of nodes along a route, thus making trivial 
alternatives unlikely where the extra cost is high in proportion to the distance covered 
between the points where the alternatives exist. This overcomes a problem with some 
previous multipath techniques that assign to the n shortest routes, which may only be 
minor deviations of the same basic route. 

In its simplest form, the assignment may consist of building just 1 tree for each 
origin zone. Although this permits only 1 path between a specific pair of zones, there 
may still be multiple choices of routes between pairs of nodes because large numbers 
of different zone-to-zone movements may pass through the same nodes. Building 1 
tree per zone may create problems near the origin zone if the volumes assigned be- 
cause of that zone are a major part of the total link volume. This problem was solved, 
without the computer cost of building several complete trees, by introducing a cost 
cordon around each origin zone and by building several "inner trees" inside the cordon 
for each complete "outer tree" outside the cordon. 

Ideally, the probability of a given path between 2 points being chosen should be in- 
dependent of the number of links constituting it, and this is achieved by making the 
mean deviation of the link times within each link class proportional to the square root 
of the specified link times. 

The method of capacity restraint adopted is one of successive assignments and speed 
adjustments, the object of which is to reach an equilibrium point where the volumes 
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assigned to each link in the network are consistent with both the speeds at which they 
were assigned and the link capacities. Speed-flow relations are specified by link class 
and a capacity indicator. After each assignment, the link speeds are adjusted accord- 
ing to the assigned volume-capacity ratio and the speed-flow relations. The program 
includes several options as to the formula used to derive the new link speed. 

The speed adjustment procedure used for the capacity restraint may be 1 of 2 
methods. The first of these is an iterative procedure where the complete trip table 
is reassigned at each stage and all previous assignments are ignored. This method 
has always produced poor results when used with a minimum-path assignment tech- 
nique because it may produce large oscillations in assigned volumes for very small 
speed changes on some links while a similar change on other links has no effect. Multi- 
path techniques should produce better results because they permit trips to be diverted 
from 1 route to another in small increments. 

The second method of capacity restraint is an incremental one where a proportion 
of the trip table, specified by the user, is assigned at each stage and the speed adjust- 
ment is based on the total assigned volume in all the previous increments and the link 
speeds at which the latest assignment was made. The speed adjustments may be made 
(a) according to the total capacity of the link or (b) according to a proportion of it equal 
to the proportion of the trip table already assigned (effectively making all speed adjust- 
ments on the basis of a fully loaded network). (Some planners have called this latter 
technique an iterative method because of the repeated adjustments to the link speeds. 

In this paper, it is called the incremental method and is distinguished from the iterative 
method in which the assignment process is applied more than once, but each assignment 
run is considered complete on its own.) The latter technique is more dynamic and re- 
sponsive to volume buildups at an early stage in the assignment process and is there- 
fore preferred to the former. 

The testing and evaluation of the program were carried out in 2 stages. The first 
series of tests consisted of unrestrained assignments on 3 Ontario networks of differ- 
ent types for comparison with minimum-path assignments. These were a small urban 
network, a rural area network, and a comprehensive regional network from the 1964 
Toronto Area and Region Model Study (TARMS). 

In each case, the first assignment was made on the final networks that were used 
for the base years of the respective studies and were assumed to have been calibrated 
for a reasonable minimum-path assignment. On all 3 networks, the link volumes ob- 
tained by the multipath were closer to observed volumes than those given by the 
minimum-path assignment with a very substantial improvement on the TARMS network. 
Further improvements were obtained with a better calibration for the multipath. 

The second series of tests was designed to evaluate different combinations of capacity- 
restraint and assignment techniques. The network used for this was the metropolitan 
area of the 1969 TARMS network. The methods tested were as follows: 


Minimum-path iterative, 

Multipath iterative, 

Minimum-path incremental, 

Multipath incremental, and 

Multipath iterative followed by multipath incremental. 
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Early tests eliminated method 1 as being unworkable. In methods 2, 3, and 4 it was 
found that the iterative method produced link speeds closer to those observed and much 
higher on the average than the incremental methods; nor were the extreme values of 
link speeds so pronounced. These differences can be explained by the manner in which 
the 2 techniques work and were to be expected. 

The greater differences in link speeds from run to run in the incremental assign- 
ment caused some large detours from the minimum path, raising the total vehicle 
mileage by a considerable amount at each stage. The iterative method was found to 
give a stable average speed very quickly, and the total vehicle-miles assigned also 
changed very little between speed adjustments, 

The comparison of the assigned volumes with observed volumes for 1,378 links where 
counts were available was the principal basis of evaluating the methods. All of the 
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methods produced roughly the same error when compared to the observed volume and, 
because there are many sources of this error other than the assignment technique (net~ 
work representation, trip table, capacity functions, and errors in the counts), the only 
conclusion that could be drawn was that no method was significantly better than any 
other for producing accurate assigned volumes. Method 4 gave marginally better re- 
sults than methods 2 or 3. Method 5 was tested because preliminary tests had indicated 
that better results could be obtained at the initial stages by using a cost function of time 
and distance instead of just time for the tree building. This was found to be true at the 
early stages, but the benefits diminished rapidly in subsequent stages. 

With all the incremental methods it was found that, although the assigned volumes 
came closer to the speed-flow relations as the capacity restraint proceeded, the com- 
parison with observed link volumes started to deteriorate after 3 or 4 speed adjust- 
ments, The fact that the assigned volume appears to be a closer fit to the speed-flow 
relation is artificial because the assigned volume has been obtained at several different 
link speeds that cannot be represented by a single speed-flow point. The value of mak- 
ing a large number of speed adjustments for an incremental assignment in order to 
obtain settlement would seem questionable. 

Some of the conclusions indicated by the results were that the multipath-assignment 
technique can be used to advantage for most types of network with or without capacity- 
restraint assignment. When the multipath technique for capacity restraint is used, an 
iterative method should be adopted to produce realistic link speeds; otherwise, the 
method of capacity restraint used has little effect. With either method of capacity re- 
straint, there is likely to be little benefit gained by making more than 4 or 5 speed 
adjustments. 

The multipath assignment and capacity restraint techniques are contained in 1 pro- 
gram but may be used independently if required. The program is written in FORTRAN 
IV for use with an IBM 360/65 computer and will accept networks with as many as 2,000 
zones, 6,000 nodes, and 14,000 links. 


USE OF SOCIOECONOMIC INDICATORS IN TRIP 
ATTRACTION OF LARGE WORK CENTERS 


George E. Mouchahoir, Urban Transportation Project, Atlanta University; and 
Paul H. Wright, School of Civil Engineering, Georgia Institute of Technology 


®IN trip-generation analysis, transportation planning has been concerned with the estab- 
lishment of a functional relation between trip-end volumes and the land use and socio- 
economic characteristics of the units from which they originate or to which they are 
destined (1). Different land use and socioeconomic characteristics, or indicators, 
have been used in different studies, and in some cases the models obtained were un- 
reliable because of the interdependence of these characteristics (2). On the other hand, 
trip-generation analysis has been determining the functional relations based on rela- 
tively large geographic units (census tracts or traffic zones). These large geographic 
units led to unreliable results in small-area studies such as a central business district 
transportation study (3, 4, 5). 

This paper attempts to analyze quantitatively the interrelations among different land 
use, travel, and socioeconomic indicators and to generate a trip-attraction model for 
large work centers instead of traffic zones. 


METHOD OF STUDY 


Because of the absence of socioeconomic and travel indicators by work centers, it 
was necessary to undertake a survey for the purpose of this study. Twenty work cen- 
ters, having a large number of employees, were selected from the Atlanta metropolitan 
area. These employees numbering about 25,000 were surveyed by mail during the winter 
of 1970. The following information was obtained: age, number of children, occupation 
or profession, education level, number of years at work, home value or rent, lot size, 
distance from home to work, travel time and distance, personal and family incomes, 
and car ownership. Similarly, the following was obtained from the employers: number 
of trip attractions, floor areas, distance from the central business district, and the 
assessed value of the work centers. 

Before employee and employer variables were coded for statistical analysis, it was 
imperative to quantify all variables in a scalable manner (6). All collected variables 
were quantified and coded easily with the exception of occupation or profession and 
education. Education was quantified by using the number of years the person spent 
acquiring an education. Occupation or profession was quantified by using the North- 
Hatt occupational scaling method (’). 

The method of analysis used in this investigation was predicated by the nature of the 
data collected and the objectives of the study. The collected data were statistical in 
nature and are all random variables. This randomness associated with more than 1 
variable, plus the interdependency of the variates brought the problem into the realm 
of multivariate statistics (8). 

Two subfields of multivariate statistical analysis were chosen to achieve the objec- 
tives of this investigation: factor analysis and component analysis (9). Factor analysis 
was used to group the observed variables together in ways that permit one to synthesize 
new entities called factors, or indicators, and to determine the degree of association 
among these variables. These factors are independent one from the other; i.e., they 
are orthogonal vectors and they occur in descending order as far as variances are con- 
cerned. That is to say, the first factor explains the largest portion of the total variance 
of the original variables. 

Component analysis was used to generate a multivariate statistical model relating 
the number of trips attracted to work centers to the socioeconomic and travel factors 
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determined from the factor analysis. This analysis starts by determining statistically 
independent groupings called components. These components are tested for their sig- 
nificance by using multivariate statistical tests, and only the significant components 
were used in the regression operation to generate the multivariate model. 


IDENTIFICATION OF EMPLOYEE AND EMPLOYER INDICATORS 


This analysis consisted of identifying the socioeconomic and travel indicators per- 
tinent to the employee and to the employer. 


Identification of Employee Indicators 


The employee socioeconomic and travel indicators were identified after the employee- 
connected variables were factor-analyzed. This analysis was performed in 2 steps. 
The first step analyzed the interrelation among the employee-related variables of all 
the work centers. This analysis indicated that the 13 variables collapsed into the fol- 
lowing 6 factors: 2 socioeconomic indicators having the variables occupation, education, 
personal and family incomes, age, and years at work strongly interrelated among them- 
selves; 1 travel indicator having the variables travel time and distance strongly as- 
sociated with it; 1 home indicator having the variables home value, rent, and lot size 
strongly associated with it; 1 car-ownership indicator having the number of cars in a 
family related to it; and 1 family size indicator having the number of children related 
to it (10). The second step was to choose 1 variable from each factor given above, to 
determine the averages of these variables by work center, and then to factor-analyze 
them. The variables chosen were number of children, occupation level, home value, 
number of cars, family income, and distance of travel. Their choice was made depend- 
ing on their degree of association with their corresponding factors. The factor analysis 
of the averages of the 6 employee variables of the 20 work centers surveyed showed 
that these variables were grouped into 4 independent factors. The first factor indicates 
that the averages of the variables occupation, home value, and family income were strongly 
interrelated under a single factor. The second factor is a single variable factor having 
the average number of children strongly associated with it. Similarly, the third factor 
is also a 1-variable factor having the average distance of travel strongly related to it. 
Conversely, the fourth factor indicates that the averages of the 2 variables, number of 
cars and family income, are highly interrelated under it. These 4 independent factors 
show that these employee average variables could be represented by the 4 factors, in 
other words, by choosing 1 strongly associated variable from each factor. The chosen 
average variables were number of children, occupation, number of cars, and distance 
of travel. 


Identification of Employer Indicators 


The employer indicators were identified by using factor analysis on the 4 collected 
variables: floor space, distance from the Atlanta central business district, assessed 
value of work center, and number of work trip attractions. This analysis grouped these 
variables into 3 independent factors. The first indicates that the number of work trip 
attractions variable is strongly associated with the floor space variable. The second 
factor is a single-variable factor having the distance from the central business district 
variable. Conversely, the third factor indicates that the 2 variables, number of work 
trip attractions and the assessed value, are interrelated and are strongly associated 
with this factor. 

Because the objective is to generate a relation of the number of work trip attractions 
variables to the significantly related employer and employee variables, it is imperative 
to retain the 2 variables, floor space and assessed value, and to reject the distance from 
central business district variable from later consideration. 
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RELATIONS BETWEEN EMPLOYEE AND EMPLOYER INDICATORS 


When the 2 sets of indicators and the significant variables of the employer and em- 
ployee had been determined, a factor analysis was performed to determine the inter- 
relations between these 2 sets. The 7 variables analyzed were number of children, 
occupation level, number of cars in family, distance of travel, floor space, assessed 
value, and number of work trip attractions. This analysis resulted in the collapse of 
the 7 variables into 5 independent factors. The first factor indicates that the variables 
floor space, assessed value, and number of work trip attractions are strongly inter- 
related under this factor. The second factor shows that the variables occupation, as- 
sessed value, and, to a lesser extent, work trip attractions are associated under a single 
independent factor. The third factor is a single-variable factor and has the number of 
children variable strongly associated with it. Conversely, the fourth factor indicates 
that the variables distance of travel, and, to a lesser extent, the number of work trip 
attractions are strongly associated with it. The fifth factor is a single-variable factor 
and has the number of cars associated with it. Therefore, the number of work trip at- 
tractions variable appears to be associated with factors 1, 2, and 4. So, choosing 1 sig- 
nificant variable from each of these independent factors will determine the independent 
variables to be used in the component regression model relating the number of work 
trips to the employee and employer characteristics. The independent variables chosen 
to relate to the number of work trip attractions variable are floor space, distance of 
travel, and occupation level. 


TRIP ATTRACTIONS MODEL 


The work trip attraction relation with the significant employee and employer variables 
was determined by using component analysis multivariate statistics. This component 
analysis starts by a principal component analysis on the independent variables floor 
space, distance of travel, and occupation. This principal component analysis resulted 
in the grouping of these variables into 3 components. The first component is strongly 
expressed by the variables occupation and distance of travel, and the second one is ex- 
pressed by the floor space variable. Similarly, the third component is expressed by the 
variables occupation and distance of travel. 

This analysis indicates that the variance explained by the first component contains 
the largest amount of variance and the variance explained by the third one contains the 
least amount. The significance of the amount of variance explained by these components 
is determined by using the Bartlett test of significance on each residual. This test in- 
dicates that, when the third component is alone as a residual, the test is not significant 
at 0.1 percent level. Conversely, when the first 2 components are tested at the same 
level, the test is significant. 

Having determined the significant components, the component analysis proceeds by 
using regression analysis on these orthogonal components. This component regression 
analysis generates the following multivariate statistical model: 


y = -240.37 x1+ 163.12 x.+ 2.10 x3 -515.48 


where 


Xi = average occupation level, 

X2 = average distance of travel between home and place of work, 
x3 = floor space, and 

y = number of trips attracted to work centers. 


The model can be expressed in its standardized coefficients form as 


y = -0.142 x1 + 0.338 x2+ 0.530 xs 


where the terms are defined as before. These standardized coefficients correspond to 
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the variables expressed with 0 means and unit variances. These coefficients show that 
the variable floor space contributes the most to the model and the variable occupation 
level contributes the least to the model. 

The F-ratio test of significance performed on this model indicates that the model is 
significant at the 0.001 percent level. Conversely, its coefficient of multiple determina- 
tion is 0.378 implying that only about 38 percent of the variation in the number of trip 
attractions to the work centers is explained by the model. Also, the efficiency of the 
model is about 14 percent, which is relatively low for predictive uses. However, it is 
worth noting that the structure of this model expresses a rational relation among the 
variables involved. The model confirms previous findings on the strong relation be- 
tween the number of work trip attractions and the floor space variables (3). The model 
also shows that the variables average occupation level and average distance of travel 
affect the number of work trip attractions to the work centers. The model implies that 
the work centers that have a great number of work trip attractions are the ones that 
employ a large number of blue-collar workers. Conversely, it suggests that the large 
work centers tend to attract workers from a great distance from the center in order to 
satisfy their large demand of skills. 
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ACCESS TO JOBS AND WILLINGNESS TO TRAVEL 


Charles B. Notess, Civil Engineering Department, 
State University of New York at Buffalo 


ABRIDGMENT 


®DURING the past few years, a number of studies have examined differences in access 
to jobs from different residential areas in the city and by different modes of travel 

(1, 2, 3, 4, 5, 6, 7, 8, 9). Generally these studies compared travel time by car and 
average, traveled as much as 3 times longer door-to-door as the worker who traveled 
by automobile. By so doing, he had access to almost the same number of jobs as his 
neighbor with a car had in a city like Buffalo, New York (12). To some extent, the bus 
rider is excluded from a significant proportion of jobs that are not served by bus lines. 

Studies that examined the performance of bus experiments to improve access to sub- 
urban jobs, in general, concluded that conventional bus service, as a means to transport 
inner-city workers to suburban jobs, was not a reasonable solution. The average cost 
per rider was too high, suburban job locations were too dispersed to be served effi- 
ciently by public transit, and the off-peak demand was very small (7, 8). 

Important factors, not examined in these studies, are the worker's perceptions of 
prospects for stable career development in the suburban job and how perceptions of 
commuting by bus fit into the life style and self image of the worker. A framework 
for analyzing these factors is presented in another report (10). Published studies by 
Wachs and Schofer (9) and by Gustafson et al. (11) have examined attitudinal responses 
to transit characteristics but have not gone beyond to examine aggregate measures of 
willingness to travel. 

This paper summarizes a study in which an aggregate measure of the willingness to 
travel to work was developed by using, as a basis, the friction factor incorporated in the 
gravity model for trip distribution. The willingness of automobile users and bus riders 
were compared for 4 different occupational groups. 


DERIVATION OF A WILLINGNESS MEASURE 


An ideal measure of willingness should be independent of the particular spatial dis- 
tribution of jobs and should be independent of the attractiveness of particular jobs. It 
should be a function of travel time (door-to-door) and mode of travel only. The concept 
of friction factor as used in the gravity model for trip distribution is ideally suited asa 
basis for deriving a willingness measure. 

The gravity model can be viewed as an equation for estimating the travel activity pat- 
terns of persons where trips originate in a zone i and are destined to a zone j; the num- 
ber of person trips from i to j is T,;. The equation is written in the following form: 


n 
Tiyy = Py ana/( 2 an.) (1) 
SE 


5 fics 


where P, is the total number of trips produced in zone i for a particular trip purpose, say, 
work or shopping trips, and A, is the total number of trips attracted to zone j for that 
same purpose. The term F,; is commonly called a friction factor and is defined subse- 
quently. 

The friction factor F,, decreases with increasing travel time between zones i and j. 
For a pair of zones, i-j, F,; may be interpreted as a measure of the decreases in at- 
tractiveness of a zone or of the willingness to travel there as the travel time to that 
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zone increases. In other words, the factor F,, can be interpreted as a measure of the 
strength of the willingness to travel as a function of travel time. 

Analyses that used the following equation to represent the variation of F,, with 
travel time are presented in another report (13): 


F,, = LoB/y(«)] f*°e* (2) 


This equation is the so-called gamma distribution with y(a) equal to (w- 1). The 
2-parameter equation provides an efficient way to describe the friction factor witha 
shape parameter o and a scale parameter B. 


CALIBRATION OF GRAVITY EQUATION 


The gravity equation, Eq. 1, was programmed for calibration on the CDC 6400 com- 
puter at the State University of New York at Buffalo. Origin-destination data from a 
1968 survey of work trips in the black ghetto of Buffalo were used, and the particular 
computer program was developed and converged within 4 iterations to obtain averaged 
F,, values that replicated averages of the observed T,; values to within 2 percent. Three 
hundred destination zones were used for trips from each of 7 origin zones. The aver- 
aged F,, and T,, values represented time averages of all F,, or T,; within 5- or 10-min 
time bands. The 10-min time bands were used for bustrips, and the 5-min bands were 
used for automobile trips (12). 

The most important conclusions obtained from examining the F,, curves were as 
follows: (a) The friction factor curves for bus trips were more erratic than those for 
automobile trips and did not follow the analytical equation as closely as did the auto- 
mobile curves; (b) much of the erratic nature in the bus curves was due to the small 
sample size (fewer than 10 trips for a particular origin zone); and (c) with the aid of 
families of curves for different values of the parameters wand 8, the values a = 1.2 and 
8 = 0.08 were selected as providing a good fit to the friction factor for automobile com- 
muters, whereas a = 1.7 and 8 = 0.04 were obtained for bus riders (4, 12). 


UNITY MEASURE OF WILLINGNESS 


The sampling variability was reduced by having 1 numerical measure of friction 
rather than several points ona curve. Therefore, the friction factor curves were re- 
duced to 1 numerical measure by forming a ratio of average travel time to work by 
workers residing in a zone divided by the ideal travel time to work if there were no at- 
tenuating effect for long trips. This ratio was called the friction index. 

In mathematical terms, the friction index was derived as follows. The average of 
travel time for observed trips from zone i is 


j=n =n 
i = p> Ths X ty ph Ty; 
where 
T,, = observed sample of work trips from zone i to zone j, 
t,; = travel time from zone i to zone j, 


| 


t, = average travel time of workers from zone i, 

A; = total number of workers employed in zone j, and 
n = total number of work zones considered. 

j=n j 

SaAxty |/CD A (4) 

j=l 3 


Ideal ty = 


Friction index = F, /ideal t, (5) 
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Table 1. Comparison of friction indexes for occupational groups. 


Friction Indexes 


Using Manu- Nonmanu- 

Occupational Total facturing facturing 
Mode Group A, Ay A, 
Automobile drivers and riders Male operatives 0.837 0.816 

Male laborers 0.822 0.802 

Male others 0.695 0,646 

Female workers 0.453 0.423 
Bus riders Male operatives 0.662 0.646 

Male laborers 0.743 0.725 

Male others 0,635 0.592 

Female workers 0.587 0,548 


Ratio of automobile to bus indexes Male operatives 1.26 0.646 
Male laborers 1.093 0.725 
Male others 1.093 0,592 
Female workers 0.772 0.548 


Table 2. Comparison of friction indexes. ; a ae 
Typical values for the friction index are 


Automobile Bus given in Table 1 for bus and automobile 
Ratio of Friction Indexes* Users Riders modes of travel for 4 occupational groups. 
Male operatives and male laborers 0.993 0.87 The friction indexes were calculated for 
Male operatives and male others —-1.175 1,019 the job distribution A; (all types of occupa- 
Male laborers and male others 1.182 1.08 


tions grouped together), for a distribution 

of manufactoring jobs only, and for a distri- 

"ae operates ton index base on marufeucea Ay. Male tors, bution of nonmanufacturing jobs derived 

; as the difference between the latter 2 dis- 

tributions. (Information on the approximate 
number of manufacturing jobs in each zone 
was available from New York State Depart- 
ment of Transportation data.) 

Ratios of the friction index for automobile to that for bus are given at the bottom of 
Table 1. Dividing the automobile value by the bus value removed (canceled out) the 
effects of the particular A, distribution on the friction index, and thus it can be shown 
that the ratios of friction index can also be obtained from a ratio of average travel time 
by car to that by bus for workers in a particular occupation (12). The ratios indicated 
that male operatives with a ratio of 1.26 were less inhibited by the friction of space 
when traveling by car than when traveling by bus. Male laborers and other male 
workers did not have so different a response between car and bus users because, for 
these latter 2 groups, the ratios were close to unity, 1.10 and 1.093 respectively. 

Let us assume, for lack of more detailed data, that the job distributions of bus riding 
and automobile riding operatives were best represented by the manufacturing Ay dis- 
tribution and that male laborers and other males using both modes of travel were best 
represented by the total A, distribution. Then the friction indexes are useful for com- 
paring the willingness to travel for these occupational groups. 

Ratios of these friction indexes are given in Table 2and indicate that male operatives 
were less willing to overcome the friction of space than male laborers, for the ratios 
of friction indexes for these 2 occupations were less than 1.0. This conclusion was valid, 
however, only if the A, distributions used, as given in Table 2, were appr opriate. 
Unfortunately no better data were available. 

Similarly, male operatives were more willing to travel than other males, automobile 
users more so than the bus riders, and male laborers more so than other males. 


Male others and female workers = 1.08 


CONCLUSION 


This brief summary shows that it is possible to develop useful measures of willing- 
ness to travel as a measure of travel time only if one has specific data on the spatial 
distribution of jobs for the particular occupational groups under consideration. These 
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indexes effectively separate the attractiveness of particular jobs from the effect of 
travel time to the job. 
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STABILITY OF RECREATIONAL DEMAND MODEL 


D. C. Robinson and W. L. Grecco, Joint Highway Research Project, Purdue University 


The principal objective of this study was to determine the stability of pre- 
viously developed recreational demand models. Models were of the form 
Y =Ae™™ and utilized easily obtainable and predictable variables. The 
study illustrated how the model can be used to predict future attendance 
and traffic volumes. Three parks were used in the study, and data were 
collected in interviews with 25 percent of arriving trips at the park en- 
trances. Almost 12,000 interviews were conducted during the period from 
1967 through 1969. The new reservoir model was developed by nonlinear 
regression analysis utilizing distance, population, and influence of other 
similar facilities. Two equations constituted the prediction model: one for 
when there is no other similar facility closer to a county than the reservoir 
under study, and one for when there is another such facility closer to the 
county than the reservoir under study. A comparison showed that, while 
parameter B remained fairly constant over time, there was an increase in 
parameter A. 


*IN 1936, a national policy on flood control was established by the Congress of the 
United States. This policy provided that the federal government would cooperate with 
states and their political subdivisions on flood control projects; that flood problems 
would be tackled jointly by the U.S. Department of the Army and the U.S. Department 
of Agriculture; that project benefits must exceed project costs; and that projects rec- 
ommended would not be constructed unless specifically authorized by law. Since 1936, 
more than 40 reservoirs have been constructed in the Ohio River Basin alone; 30 more 
are in the planning or construction stage. 

Flood control, irrigation, and hydroelectric power were the 3 purposes originally 
considered in the benefit-cost analysis for justification of the construction of dams and 
their resulting reservoirs. There is, however, an added dividend of flood control dams 
that only in recent years has been recognized and included in the economic analysis. 
This dividend is the recreational lakes that are created by such dams. 

Recreation is now recognized as an important business in this country. A substantial 
portion of the gross national product is derived from recreational pursuits in all areas 
of the nation. 

The development of the future highway network must take into account the traffic- 
generating abilities of a recreational park or reservoir. A recreational facility is of 
little value unless it has adequate access. Recreational highways exhibit such unique 
traffic patterns that it is not enough simply to use techniques that have been found valid 
for the analysis of traffic flows on urban streets and nonrecreational rural highways. 
The multipurpose reservoirs are natural recreational attractions and consequently rec- 
reational traffic generators. It is essential for the full utilization of the recreational 
potential of a reservoir that transportation planning coincide with reservoir develop- 
ment plans. Little factual information is available at present that can be used by plan- 
ners to estimate the recreational demand. 


SCOPE 


The area of water available for recreational purposes within the state of Indiana is 
in the process of being substantially increased under flood control programs of the U.S. 
Army Corps of Engineers. Six multipurpose reservoirs have been completed to date; 
8 more are authorized and many more are planned (Fig. 1). The 6completed reservoirs 
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have added a total of more than 20,000 acres of water at summer levels. The Indiana 
State Department of Natural Resources is responsible for the development and operation 
of recreational facilities at such reservoirs. 

Not until 1967 (1) was any information made available to the Indiana Department of 
Natural Resources for the planning of recreational facilities at reservoirs (1). The 
result of this initial research, in which Cagles Mill and Mansfield reservoirs were 
studied, was a model (referred to as the "previous model") for the prediction of rec- 
reational trips to new reservoir areas in Indiana. The model utilized road distances, 
county population, and influence of other similar facilities as the parameters affecting 
attendance. The technique developed and reported in the literature illustrates how the 
model can be used to predict future attendance and traffic volumes to recreational areas 
at multipurpose reservoirs (2 ). During the initial work, insufficient data were collected 
at Monroe Reservoir (which was then in the process of being developed) to be incorpo- 
rated in the prediction model. 

The rapid growth of recreational travel that is expected in the next decade requires 
that all demand models be under continual surveillance. The scope of this work was to 
check the stability of the previous model. 

The recreation facilities at each of the reservoirs are similar in type; however, the 
amount of facilities varies among the 3 reservoirs. For example, Cagles Mill and 
Mansfield each have 1 beach several hundred feet in length; Monroe has 3 beaches (one 
of which is operated by the U.S. Forest Service). Boat-launching ramps are provided 
at various locations around each reservoir: 5 at Mansfield, 2 at Cagles Mill, and 9 at 
Monroe. Within the main recreational areas at each reservoir are located the camp- 
grounds, beaches, concession stands, boat rentals, picnic areas, hiking trails, and 
bathhouses. In general, each park is well kept by personnel who know and take pride 
in their work. 

Proper utilization of these facilities requires an adequate highway system, ranging 
from local access roads to state highways. The main objective of this research was to 
provide a simplified method for estimating future traffic volumes for new facilities of 
this type. 


PROCEDURE 


Data collection for this continuing study was carried out between June 1967 and 
August 1969 at each of the 3 parks. The primary source of data was a 25 percent in- 
terview survey of vehicular trips arriving at the parks. The 25 percent sample was 
chosen because it was considered adequate for analytical purposes and it did not create 
delays to arriving visitors. During the 3-year period, interviews were performed at 
the following locations within each park: 


Park Location 
Mansfield Main gate; dam and Hollandsburg boat ramps 
Cagles Mill Main gate; Cunot dock boat ramp 
Monroe Paynetown, Fairfax, and Hardin Ridge gate- 


houses; Cutright, Dam, and Moores Creek 
boat ramps 


In the interviews, the driver was asked from which county the trip had originated 
and the purpose of the trip, the interviewer recorded the license number (the prefix of 
which, on Indiana passenger cars, is a code number relating to the county in which the 
car was licensed); the number of adults and children (persons under 12 years of age); 
equipment carried such as a boat, house trailer, or camping trailer; and time of day, 
date, park, and location (main gate or boat ramp). The number of adults and children was 
of greater importance prior to 1967 because the fee charged was dependent on the num- 
ber of adults in each car. However, in 1967 the state introduced a fixed rate for each 
vehicle; and in 1968, an optional season pass was available. 

All the interviews in 1967, 1968, and 1969 were conducted during weekends from 
Friday afternoon to Sunday afternoon during the months of June, July, and August. 
Weekends were assigned at random. In 1967, each park was visited on 3 weekends. 
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In 1968, Mansfield and Cagles Mill were each visited on 3 weekends, and Monroe was 
visited on 4 weekends. In 1969, each park was visited on 4 weekends. 

The general procedure adopted in 1965 and 1966 was maintained during 1967, 1968, 
and 1969. Interviewing took place on Fridays from 2 p.m until 9 p.m., Saturdays from 
9 a.m. until 8 p.m., and Sundays from 9 a.m. until 5 p.m. These hours were selected on 
the basis of a pilot study made at Mansfield in1965. After about 9 p.m. on Fridays and 
before 9 a.m. on any day of the week, few arrivals were noted. The parks were open 
24 hours a day throughout the summer, but interviews were conducted only during the 
stated hours. The park records on attendance showed that on weekends the arrivals 
during the interview period usually accounted for about 90 percent of the total visitors 
on Saturdays and Sundays and for about 75 percent on Fridays. 

During the 3-year period (1967 through 1969), 11,800 samples were collected by the 
interviewers and, of this number, 11,400 were usable. The data obtained from the in- 
terviews were coded for the summation program that was used primarily to determine 
the number of annual trips to each park from each county in Indiana and Illinois and 
from other states. 

It was not an unusual occurrence for a visitor to report multiple purposes when 
asked the reason for visiting a particular reservoir. It is probable that most trips to 
a reservoir are made with more than one purpose in mind. However, in this study, 
only the purposes reported were recorded because these were considered to be the pur- 
poses that inspired the trip. Also, no effect was made to determine whether, in fact, 
the stated purposes were actually accomplished. The trip purposes considered were 
boating, camping, fishing, picnicking, hiking, swimming, looking, and other. 

It was apparent, once county trip totals were determined, that more than 90 percent 
of all trips originated from within 125 miles of a reservoir. Thus, for the purpose of 
this analysis, no counties beyond 125 miles of each reservoir were considered. The 
observed trips per county beyond this range were so sparse as to be insignificant. To 
standardize the trip rate from any particular county required a unit of measure. The 
previous model used trips per 1,000 population; this was adhered to in this phase. The 
official attendance (vehicles) for each year at Mansfield and Cagles Mill was obtained 
from attendance records maintained by the Department of Natural Resources; Monroe 
attendance figures were obtained from the park superintendent at Monroe Reservoir and 
the U.S. Forest Service. 

The official total attendance figure for each reservoir was divided by the appropriate 
total attendance expansion factor, which is the ratio of samples interviewed at boat 
ramps to samples interviewed at main entrances (Table 1). In the case of Monroe, the 
expansion factors were applied only to the official attendance figures of the State Rec- 
reaction Area; the attendance at Hardin Ridge (U.S. Forest Service) Recreation Area 
was included later. The estimated total attendance (vehicles) at each reservoir for 
each year is given in Table 1. 

The observed trips from a county were divided by the appropriate county trip ex- 
pansion factor, which is the proportion of the estimated total park trips that were 
sampled in a year. County trip expansion factors are given in Table 1. 

The Indiana county population estimates for 1967, 1968, and 1969 were linear inter- 
polations of projections developed by the Indiana University, Graduate School of Busi- 
ness (3). The Illinois county population estimates were linear projections of 1960 
census data and U.S. Bureau of Census estimates for 1966 (4). The distance figures 
were developed from the center of each county to the center of each reservoir. Road 
miles of the primary highway system were measured. 

It became apparent, when Illinois and Indiana county trip rates were compared for 
equivalent distances from a reservoir, that Illinois county trip rates were significantly 
lower. It was necessary that a state-line penalty equal to 30 miles be added to all 
Illinois counties. This has the effect of including in the analysis only those Illinois 
counties within 95 miles of a reservoir. 


ANALYSIS 
Model Development 


For each reservoir, a plot of the county trip rates (calculated from 1967, 1968, and 
1969 data) versus distance from the reservoir indicated an exponential relation. This 
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supported Matthias' choice of an exponential model to describe the 1965 and 1966 data. 
It should be added that this result was not entirely unexpected because previous re- 
search (5) showed an exponential relation between trip length and distance. 

The form of the function used by Matthias and subsequently in this research is 


yeAe™ 


where 


Y = annual trips/1,000 population from a county to a reservoir; 
A = Y intercept of nonlinear regression curve; 

B = rate of change of nonlinear regression curve; and 

X = distance from a county to a reservoir, in tens of miles. 


There are 2 approaches by which parameters A and B in the equation may be esti- 
mated. First, it is possible to use the method of least squares after the function is 
transformed into 


In Y =InA - BX 


Second, a nonlinear regression analysis that estimates the parameters in an iterative 
manner may be applied. The first approach assumes that the errors in the transformed 
function are additive, which necessitates that the errors in the original be multiplicative 
(an assumption that has no physical basis). The second approach assumes that additive 
errors are in the original function and, because errors of an additive nature are more 
probable, this method was adopted. (An added benefit from the use of the second ap- 
proach was only apparent later; this was when certain distant counties were found to 
have Otrip rates. A logarithmic transformation would not have been able to deal with 
this situation.) 

The nonlinear regression analysis utilized was NONLIN, a revised version of SHARE 
3094 (6); this was described in some detail by Matthias (1). Basically the program 
finds the estimates of parameters A and B in the function Y = A e** + € by minimizing 


ye’= LW -¥) 


where ¢ is the residual error, and Y is the estimate of Y. It is an iterative technique 
that requires an initial estimate of the parameters A and B. 

The previous model was made up of 2 regression equations. One equation was to be 
used for counties that are closest to the specified reservoir, and the other for counties 
that are closer to one or more other reservoirs than to the specified reservoir. The 
decision was made to arrange the data into 9 subgroups. Six subgroups were for a com- 
bination of Cagles Mill and Mansfield (3 years by closest and intervening categories); 
and 3 subgroups were for Monroe (3 years by 1 group containing all counties). There 
are 2 reasons for isolating Monroe data: (a) It is apparent from the total attendance 
figures that Monroe is still in its initial growth period (in contrast to Cagles Mill and 
Mansfield, which are older reservoirs), and (b) Monroe is a much larger reservoir 
than either of the other two (10,750 acres compared to Cagles Mill's 1,400 and Mans- 
field's 2,100), and is, in a sense, unique because it will remain the largest single body 
of water in the state for many years. The second reason is essentially the reason for 
the closest and intervening county groups being combined for Monroe. It is felt that 
Monroe is such a large trip attractor that intervening opportunities are not really ap- 
plicable. Most of the reservoirs planned by the state are more nearly the size of Cagles 
Mill and Mansfield; therefore, for predictive purposes, a model based on data from 
these 2 reservoirs should be more reliable than one that either includes Monroe data 
or is based on Monroe data alone. 

Monroe Model—The idea underlying the following analysis is that, if it can be shown 
that the parameter B; (for i = 1 to 3) does not vary significantly among the 3 years, it 
might be possible to derive a prediction equation (with a pooled estimate of parameter 
B) by extrapolating the parameter A to the design year. 
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The first step in the analysis was to test for homogeneity of variances of the trip- 
rate data during the 3-year period. This assumes that the regression equations were 
reasonable predictors to the data (which they were). Under these conditions, testing 
from homogeneity of variances in the data is approximately equivalent to testing for 
homogeneity of the error estimates of the regression equations. Homogeneity of the 
error estimates of the regression equations is necessary in order to test the signif- 
icance of By. 

Two tests were applied to the data: (a) Bartlett's test (7), in which a chi-square 
statistic is computed (assuming that there are normal populations), and (b) Foster- 
Burr's test (8), in which aQ-statistic is computed that is a monotone function of the 
coefficient of variation of the sample variances. The fact that the populations are not 
normal reduces the inferences possible from Bartlett's test; however, less research 
has been directed toward non-normal populations than to normal populations, so the 
test was applied bearing the limitations in mind. 

Both Bartlett's test and Foster-Burr's test produced highly significant statistics for 
the raw data, Y, and transformed data, In(Y + constant), and this led to the rejection 
of the hypothesis of equal population variances (the constant was added to enable the 
logarithmic transformation to be made). On the basis of this result, it was decided to 
delete from the data those counties having trip rates of less than 1.0. It was hoped that 
homogeneous variances would result from this action. This reduced the sample sizes 
from 64 for each year to 52, 46, 51 for 1967, 1968, 1969 respectively. The nonlinear 
regression program was rerun with the smaller data sets, and the parameters produced 
are given in Table 2. Most of the parameters have been only slightly reduced by ex- 
cluding trip rates less than 1.0. 

Bartlett's test and Foster-Burr's test were applied to these data; the chi-square 
statistic from Bartlett's test was 0.393, and the Q-statistic from Foster-Burr's test 
was 0.335, both of which are insignificant at an a-level of 0.01. In this case the hypoth- 
esis of homogeneity of variances cannot be rejected. 

It was then possible to test the hypothesis that the parameters B, are equal. The pro- 
cedure, explaned by Ostle (9), is to first test the hypothesis that all the observations can 
be described by 1 regression equation. If the F-statistic computed is significant (leading 
to the rejection of the hypothesis), the hypothesis of equal parameters B, can be tested 
by another F-test. F-values of 8.63 and 0.496 respectively were obtained from the 2 tests; 
thus, the hypothesis that all the observations can be described by 1 regression equation 
is rejected at an a-level of 0.25. 

The pooled estimate of parameter B for inclusion in the equation for each year was 
established when the nonlinear regression program was run for 1967, 1968, and 1969 
data combined. The value of B was calculated to be 0.558. As a last step, the non- 
linear regression program was rerun for each year, a regression line with parameter 
B = 0.558 was forced through the data, and parameter A was obtained in the equation 


Y - A e7 9-558 
The 3 equations that resulted were as follows: 
Y = 217 7°98 for 1967 
Y = 355 e7 °° for 1968 
Y = 634 e7 °° for 1969 


Figure 2 shows how parameter A varies from 1967 to 1969. The sharp increase that 
has occurred is a combination of the growth of Monroe in terms of facilities, reputation, 
and popularity and an increase in recreational trip-making in general. The former is 
by far the largest component of the growth. 

From the explanation given above, an extrapolation of the present trend of parameter 
A (line A) is likely to overestimate the design-year parameter A. What is more likely 
to happen is a leveling off as indicated by lines B, C, and D. Unless there is knowledge 
of other factors, however, there is no basis for choosing any one line over the others. 
It was, therefore, decided to use the value of parameter A as obtained from the 1969 
data and to acknowledge that it is a conservative estimator of the total annual trips to 
Monroe in some future year. 


Figure 1. Major reservoirs in Indiana. 
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Figure 2. Changes in parameter A in Monroe model 


from 1967 to 1975. 
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Table 2. Nonlinear regression parameters. 
Trip Monroe Closest Intervening 
Rates ———E 
<1.0 Year A B A B A B 
Included 1967 228 0.606 517 0.571 387 0.715 
1968 342 0.530 554 0.736 105 0,354 
1969 656 0.588 363 0.523 202 0.548 
Excluded 1967 222 0.576 516 0,570 243 0.453 
1968 340 0.525 520 0.638 107 0.305 
1969 648 0.575 362 0,621 127 0.511 


Year 


1967 
1968 
1969 


1967 
1968 
1969 


1967 
1968 
1969 


Total 

Attendance Estimated 
Expansion Total 
Factor Attendance 
0.814 42,713 
0.781 50,570 
0.760 43,149 
0.739 60,486 
0.808 63,592 
0.855 41,477 
0.915 39,269 
0.897 77,758 
0.870 108, 646 


County 
Trip 
Expansion 
Factors 


0.015 
0.012 
0.022 


0.018 
0.014 
0.030 


0.051 
0.012 
0.013 
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The choice of a 1969 equation as the prediction equation was not based on the fact 
that no significant differences were found between the B, parameters. The equation 
adopted as the Monroe model, 


vs 634 gr OneSeR 


however, does include a contribution from the data of each year (the pooled estimate of 
B), and so the previous analysis has not been ignored. The Monroe model is shown in 
Figure 3. 

New Reservoirs Model—The new reservoirs model is to consist of 2 equations (one 
from each of the closest and intervening groups of equations) that are considered to be 
the best for prediction purposes. 

Exactly the same procedure that was employed in the development of the Monroe 
model was employed to develop the equations for the new reservoirs model. The pa- 
rameters of the 3 equations for the initial run of the nonlinear regression program are 
given in Table 2. Once again, the variances of each year's data were not homogeneous 
until counties with trip rates of less than 1.0 were excluded from the analysis. The 
program was rerun, and the parameters of the 3 new equations obtained are also given 
in Table 2. The results from Bartlett's test and Foster-Burr's test for those data were 
such that the hypothesis of equal variances could not be rejected. 

The hypothesis of equal B, parameters for each year for closest and intervening was 
tested next, and in both cases it was found that the hypothesis could not be rejected. 
The data for each year were combined within closest and intervening, and the nonlinear 
regression program was rerun to find a pooled estimate of parameter B for each group. 
The pooled estimates of B were 0.573 and 0.407 for closest and intervening respectively. 
The result of forcing these B values through the data for each year is the following 
equations for closest: 


¥ = 520 e7°-578* for 1967 
¥ = 465 e °®™* for 1968 
Y = 398 e 9°73 for 1969 


and for intervening: 


Y = 212 e7°-*°* for 1967 
Y =151 e°°'*" for 1968 
Y = 136 e°°'*"* for 1969 


It is immediately apparent that parameter A is decreasing in both cases (while in the 
case of Monroe, parameter A was increasing yearly). To understand why this is the 
case requires that the location of Mansfield and Cagles Mill with respect to Monroe be 
considered. All 3 reservoirs are within 60 miles of each other; because of this, it would 
be naive to think that the attendance at Mansfield and Cagles Mill should remain unaf- 
fected during the growth period of Monroe. It is considered likely that this downward 
trend in parameter A is no more than a transient response to the appearance of Monroe 
and that it will not continue for more than a few years. For this reason and for the 
reason that the future recreational reservoirs (for which the new reservoirs model is 
intended) will not be close to such a large facility as Monroe, it was decided to use the 
equations that were developed from 1967 data for closest and intervening. 

The actual equations adopted to constitute the new reservoirs model are 


Y = 520 e °°”* for closest 


Y = 212 e~°*** for intervening 


Both equations, which are shown in Figures 4 and 5 respectively, use the pooled esti- 
mate of parameter B. 
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Trip-Making Characteristics 


Total Annual Trips—It is not enough for the planner to know how many trips (as pre- 
dicted by the new reservoirs model) will be made to a particular reservoir in any year. 
The additional information that he requires is the distribution of those trips during the 
year, the week, and the day so that he can provide for adequate park facilities, seasonal 
hiring of park staff, and easy and adequate access. Because the planner is interested 
in the maximum volumes, it is in terms of these that the following analysis is performed. 

Approximately 95 percent of all trips to a reservoir are made between the beginning 
of April and the end of September. This is based on the earlier study, for no out-of- 
season interviews were performed during this phase. The maximum volume week was 
determined for each reservoir for each year from official attendance figures, and the 
average ratio of maximum volume week to total annual trips was calculated to be ap- 
proximately 10 percent. 

In the earlier work, it was found that, on the average, 25 percent of all weekly trips 
arrived at the reservoir during the period from Monday through Friday morning, as- 
suming that weather conditions were similar. This means that, on the maximum volume 
weekend, 75 percent of the 10 percent of the total annual trips to the reservoir can be 
expected, which amounts to 7.5 percent. 

Approximately 50 percent of all weekend trips arrived on Sunday. It is, therefore, 
concluded that on the maximum volume weekend the reservoir attendance will amount 
to 7.5 percent of the total annual trips and that the highest daily volume (3.75 percent of 
the total annual trips) will occur on Sunday. 

Trip Distribution—A further breakdown may be made on the basis of hourly arrivals 
that were recorded for each reservoir in the initial phase. It can be seen that, on the 
average, 62 percent of all Sunday arrivals come in the 4-hour period between 11 a.m. 
and 3 p.m. This information can be used to calculate the capacity required on reservoir 
access roads. 

Besides the vehicular trips that can be expected on the maximum volume weekend, 
it is of importance to know how many people are associated with those trips. During 
this study, it was found that the average number of persons per trip was 3.75 and the 
average number of children per trip was 1.02. 

Figure 6 shows that 90 percent of the sample trips originated within the 125-mile 
radius adopted for this analysis. This median distance traveled is 52 miles, and the 
associated travel time is 62 minutes. 


RESULTS 


The primary objective of this phase was to evaluate the growth trends of recreational 
usage of multipurpose reservoirs with reference to the model developed earlier. The 
choice of an exponential model, Y = Ae **, to relate trip rates and distances in the 
earlier phase was substantiated by the data collected during this study. Three equations 
of the same form were developed. Of these, 2 equations (developed from data collected 
at Mansfield and Cagles Mill reservoirs) constituted the new reservoirs model. The 
third equation (the Monroe model) is to be used to predict annual trips to Monroe res- 
ervoir only. 

The 2 equations developed in the initial phase for the closest and intervening cate- 
gories were Y = 338 e °**”* and Y = 129 e~’**"* respectively. Comparing the equations 
from both phases shows that, although an increase in the value of parameter A (by fac- 
tors of 1.54 and 1.64 for the closest and intervening categories respectively) has oc- 
curred over time, there has been little change in the value of parameter B (almost none 
in the case of the closest counties). This is an important result, for it implies that a 
growth in the trip rates (which was being investigated in this phase) is best measured 
by changes in the value of parameter A. Furthermore, if continued study indicates even 
higher trip rates, only the parameter A in each of the 2 equations need be adjusted. It 
is not known by how much or in what manner parameter A of the 2 equations is likely to 
change during a period of 1 or 2 decades. The data collected in this recreational study 
rendered any prediction of the future behavior of parameter A unwise; only the fact that 
A did increase over time was observed. 


Figure 3. Annual trips to Monroe. 
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Figure 5. Annual trips to intervening park. 
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Figure 4. Annual trips to closest park. 


1000 


TRIPS PER THOUSAND POPULATION 
3 


DISTANCE {TENS OF MILES) 


Figure 6. Cumulative distribution of trips. 
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The single equation that constitutes the Monroe model is Y = 634e°°°". There is 
no way of comparing this equation with those of the new reservoirs model because in its 
development all counties within 125 miles of Monroe were used in the same category. 

No Monroe data were used in the initial phase, so there is no way to make a comparison 
between the 2 phases of the study. This equation is a conservative estimator for the 
total annual trips to Monroe because Monroe can still be considered to be in its initial 
growth period. 

It is concluded that the new reservoirs model, which is based on easily understood 
and readily obtainable variables (distance, population, and influence of similar facili- 
ties), is able to predict future attendance at new reservoirs with reasonable accuracy. 
In contrast to other previously developed models, which require many socioeconomic 
and park characteristics variables (often difficult to measure and evaluate and extremely 
difficult to project), the new reservoirs model is probably as accurate and much simpler 
touse. The new reservoirs model is adequate for advanced-planning purposes and can 
be used to predict reservoir attendance and traffic volume estimates. 

The objectives of this study were to check the previously developed models for sta- 
bility over time and to present a simplified procedure that could be easily implemented 
by the highway department. One can conclude that models of this type must be under 
constant surveillance, for the demand function is obviously changing. The simplified 
prediction procedure can be summarized in the following manner: 


1. Determine the location of the reservoir; 

2. Locate other similar recreational facilities; 

3. Determine the road distance (miles) to the reservoir from counties within 125 
miles; 

4. Obtain county population predictions for the design year; 

5. Determine which of the counties are closer to the reservoir under study than to 
any other similar facility; 

6. Determine the trip rates for each county closest to the reservoir (Fig. 4); 

7. Determine the trip rates for the remaining counties (Fig. 5); 

8. Calculate for each county the total annual trips by multiplying the trip rate by the 
population prediction; and 

9. Sum the total annual trips for all counties, and divide by 0.9 to account for trips 
originating farther than 125 miles away and to obtain the estimated total trips for the 
design year. 
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RELIABILITY ANALYSIS IN 
LAND USE-TRANSPORTATION PLANNING 


Kumares C. Sinha, Department of Civil Engineering, Marquette University 
ABRIDGMENT 


°®IN THE preparation of land use-transportation system plans for any area, the input 
data involve several demographic and economic variables. For example, the future 
transportation demand of an area can be predicted on the basis of variables such as 
population, automobile ownership, and employment. Once the estimates of the input 
variables for a future design year are prepared, the aggregate information about the 
transportation demand for that design year can be obtained, and the required trans- 
portation facilities can be planned accordingly. However, the estimation of each of the 
causal variables is associated with uncertainties. These uncertainties are associated 
to a certain extent with the theory and technique used in predictive models that deter- 
mine functional relations of the transportation demand with the causal variables. Similar 
statements can be made for most of the planning process, for land use-transportation 
planning entails a complex system acted onby numerous variables whose behavior isnot 
well understood. In the area of planning methodology, the emphasis has been thus far 
on the development of approximate descriptions of unpredictable, socioeconomic phe- 
nomena. In planning practice, however, the use of any analysis is to aid the planner in 
making decisions. These decisions might involve the planning and design of public fa- 
cilities such as transportation systems, sewerage and water supply systems, educa- 
tional facilities, and recreational areas. Because of the uncertainties associated with 
the predictions of future year demands of such systems, the planner is faced with a 
difficult task of choosing the most accurate design level of demands. For example, if 
the design demand is expressed in terms of population figures and if the estimated 
future population figures are associated with a wide range of uncertainty, no amount of 
sophistication and refinement of the planning models will cause any substantial improve- 
ment in the probability of success of the facilities planned on the basis of these models. 

For an efficient and effective planning process, a planner must make decisions as 
prudently as possible in the face of uncertainty. Before a particular element in plan 
design is included, the planner must search for the best design suited to the require- 
ment involving the probability of success of the particular design element. In most 
situations the planner will obviously depend on his judgment as well as his experience. 
However, Bayesian decision statistics provide an excellent tool that can aid the planner 
in arriving at an optimal decision by combining his subjective judgment with objective in- 
formation. 

A planned system may be defined as one that is conceived, designed, and implemented 
to a specified level of reliability. A level of reliability is a measure of assurance that 
the system planned will serve its intended function successfully. Planning reliability 
can then be defined as a quantitative measure that the system planned will achieve suc- 
cessfully some presented level of performance. 

In case of a transportation system plan, the component elements that constitute the 
plan design must be systematically examined in terms of their adequacy for the re- 
quired demand during their intended life. For example, one aspect of the planning reli- 
ability of a transit system can be conceived as the measure of success in the prediction 
of ridership demand. Because the design level of ridership demand will determine the 
extent of the facilities to be provided, it is essential that the associated reliability in 
ridership prediction be carefully examined. The reliability of a highway system plan 
must also be examined in terms of its adequacy for the intended period of its design life. 
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Both the future ridership demand for a proposed transit system and the expected traffic 
volume on a planned highway network can only be estimated as a prediction. Because of 
the nature of the forecasting process in transportation system planning, the adequacy of 
a plan design can only be examined in the face of uncertainty. In other words, the eval- 
uation of planning reliability is mostly concerned with the analysis of uncertainties involved 
with the forecasting process. With the application of a probability theory, a rational 
procedure can be developed to examine the extent of uncertainties involved in a particular 
demand forecast. Then, a decision theoretic approach can be adopted to determine the 
optimal level of a design parameter that is to be considered in systems planning. 

As an example, the problem of forecasting the total population in the 7 counties of 
the southeastern Wisconsin region was considered. The analysis was performed in 2 
steps. In the first step, the reliability of population projections obtained from the 
application of various techniques was tested. In the second step, estimates were pre- 
pared to evaluate the reliability of forecasting in a particular county. Subsequently, an 
optimal decision was made regarding the level of future design-year population for each 
county. The reliability of population projections was measured by the probability of 
predicted population figures falling within a given tolerance range. The reliability values 
for the different forecasting techniques considered ranged from 0.660 to 0.977 for a 
tolerance range of +5 percent. On the other hand, the reliability of forecasting for in- 
dividual counties ranged from 0.868 to 0.969 for the same tolerance range. The upper 
bound of the optimal size of the sample to be tested was found to be 6 for given prior in- 
formation and for assumed values of severity constant and unit sampling cost. 

The example discussed here is a simple one; in reality, however, the planning and 
design decisions are associated with multiple variables, and a reliability analysis of the 
entire system should be made by optimizing the combined expected utility. Futhermore, 
more reliable cost data are required to develop associated utility functions. These 
data would include the cost items involved with the overdesigns and underdesigns of a 
planned system such as a transportation facility. However, these cost data can be de- 
veloped with reasonable accuracy on the basis of the records maintained by the various 
public and private agencies. 

The procedure outlined in the paper provides an opportunity for a planner to arrive at 
an optimum level of design-year population. Such a procedure can be an extremely use- 
ful tool in the overall land use-transportation planning process. 
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DISTRIBUTION PATTERNS OF AUTOMOBILE TRAVEL 
IN THE NEW YORK METROPOLITAN AREA 


J. David Jordan, Tri-State Regional Planning Commission 


ABRIDGMENT 


®PLANS for highway improvements are dependent on traffic growth. Even aside from 
automobile ownership, this growth in traffic can be studied at the household, the basic 
trip-producing unit. To this end, home interview survey data for selected communities 
in the New York metropolitan area were processed to obtain the geographic distribution 
of vehicle-miles traveled in relation to the location of the households owning cars. Of 
particular interest were variations in the amount and distribution of travel due to the 
effect of density and supply of expressways. 


STUDY DESIGN 


Four areas were selected for analysis: 2 in New York City and 2 in the suburbs. 
Each was 4 square miles in size and was identified by the name of the community that 
covered most of its area. The areas were chosen in such a manner as to form 2 matched 
pairs. Within each pair, the 2 constituent communities had similar population densities 
and roughly equivalent transit service but differed in the supply of expressways. 


Community 
Density Range Express- 
Num- (persons/ way 
Area ber Name Sq mi) Access 

New York City 1 Flatbush, Kings County 35,000 Poor 
2 Fordham, Bronx County 40,000 Good 

Suburbs 3 Westwood, Bergen County, 
New Jersey 5,000 to 6,000 Poor 

Albertson, Nassau County, 
New York 5,000 to 6,000 Good 


Automobile driver trips originating from or terminating at households within each se- 
lected area were summarized according to their airline trip lengths from the area's 
geometrical center. The vehicle-miles of travel generated by these trips were also 
summarized according to distance from the center. 


RESULTS 


The suburban areas generated about twice as many automobile driver trips per auto- 
mobile as those in the city. Within each of the matched pairs, however, the differences 
in the supply of expressways had an insignificant effect on the travel generated per 
vehicle. 


Automobile Driver 
Community Trips/Automobile 


ul 2.1 
2 1.8 
3 4.4 
4 3.8 
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Because there was no Significant variation in average trip length among the 4 areas, 
the suburban areas also generated about twice as many vehicle-miles of travel per auto- 
mobile as the city pair, with variations in expressway supply causing negligible dif- 
ferences, 


Vehicle-Miles of 
Community Travel/Automobile 


1 7.9 
2 8.0 
3 17.1 
4 16.7 


The automobile driver trip distribution pattern displayed minor differences; the city 
areas had a larger percentage of trips in the 2- to 5-mile range, and the suburban areas 
had a higher percentage beyond 14 miles. This tendency was more apparent in the dis- 
tribution of vehicle-miles of travel where the suburban areas had about a quarter of 
their travel beyond 10 miles, compared to 11 to 16 percent for the city pair. 


VMT Within 5 VMT Beyond 10 
Miles of Area Miles From 


Centroid Area Centroid 
Community (percent) (percent) 
1 65.1 16.1 
2 65.7 11.2 
3 56.7 23.4 
4 §1.5 26.2 


Because the 2 pairs of locations represented extremes of density, a third pair of 
communities was selected for similar analysis. Both locations were in the intermediate 
density range of 15,000 to 20,000 persons/square mile, but one was better served by 
limited-access facilities. The amount of vehicle travel generated per automobile was 
quite similar for both and was between the values obtained earlier for the high density 
pair and the suburban pair. 


CONCLUSION 


An analysis of home interview survey travel data in 4 selected communities indicated 
that the supply of expressways had a minimal effect on the generation and distribution of 
automobile trips. Residents of the suburban communities generated about twice as many 
vehicle-miles per automobile as did those who lived in the denser urban areas, Gen- 
erally, as density increased, the week-day mileage generated per car decreased, no 
doubt because of an improved level of transit service and the greater costs associated 
with moving and parking automobiles. 
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THE National Academy of Sciences is a private, honorary organization of more than 800 
scientists and engineers elected on the basis of outstanding contributions to knowledge. 
Established by a congressional act of incorporation signed by Abraham Lincoln on March 
3, 1863, and supported by private and public funds, the Academy works to further 
science and its use for the general welfare by bringing together the most qualified indi- 
viduals to deal with scientific and technological problems of broad significance. 


Under the terms of its congressional charter, the Academy is also called upon to act as 
an official—yet independent—adyiser to the federal government in any matter of science 
and technology. This provision accounts for the close ties that have always existed be- 
tween the Academy and the government, although the Academy is not a governmental 
agency and its activities are not limited to those on behalf of the government. 


The National Academy of Engineering was established on December 5, 1964. On that 
date the Council of the National Academy of Sciences, under the authority of its act of 
incorporation, adopted articles of organization bringing the National Academy of Engi- 
neering into being, independent and autonomous in its organization and the election of 
its members, and closely coordinated with the National Academy of Sciences in its ad- 
visory activities. The two Academies join in the furtherance of science and engineering 
and share the responsibility of advising the federal government, upon request, on any sub- 
ject of science or technology. 


The National Research Council was organized as an agency of the National Academy 
of Sciences in 1916, at the request of President Wilson, to provide a broader participation 
by American scientists and engineers in the work of the Academy in service to science 
and the nation. Its members, who receive their appointments from the President of the 
National Academy of Sciences, are drawn from academic, industrial, and government 
organizations throughout the country. The National Research Council serves both 
Academies in the discharge of their responsibilities. Supported by private and public 
contributions, grants, and contracts and by voluntary contributions of time and effort by 
several thousand of the nation’s leading scientists and engineers, the Academies and their 
Research Council thus work to serve the national interest, to foster the sound develop- 
ment of science and engineering, and to promote their effective application for the bene- 
fit of society, 


The Division of Engineering is one of the eight major divisions into which the National 
Research Council is organized for the conduct of its work. Its membership includes rep- 
resentatives of the nation’s leading technical societies as well as a number of members-at- 
large. Its Chairman is appointed by the Council of the Academy of Sciences upon 
nomination by the Council of the Academy of Engineering. 


The Highway Research Board is an agency of the Division of Engineering. The 
Board was established November 11, 1920, under the auspices of the National Research 
Council as a cooperative organization of the highway technologists of America. The pur- 
pose of the Board is to adyance knowledge of the nature and performance of transporta- 
tion systems through the stimulation of research and dissemination of information de- 
rived therefrom. It is supported in this effort by the state highway departments, the U.S. 
Department of Transportation, and many other organizations interested in the develop- 
ment of transportation. 


HIGHWAY RESEARCH BOARD 
NATIONAL ACADEMY OF SCIENCES—NATIONAL RESEARCH COUNCIL 
2101 Constitution Avenue Washington, D. C. 20418 
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